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DNA ENCODING ACYLCOENZYME A: CHOLESTEROL 
ACYLTRANSFERASE AND USES THEREOF 

5 

This application is a Continuation-In-Part of U.S. Serial 
No. 08/657,620, filed May 30, 1996, the content of which 
is incorporated by reference into this application. 

10 Throughout this application, various publications are 

referenced by Arabic numerals. Full citations for these 
publications may be found listed at the end of the 
specification. The disclosures of these publications in 
their entireties^ .are hereby incorporated by reference 

15 into this application in order to more fully describe the 

state of the art as known to those skilled therein. 

Background of the Invention 

Cholesterol or related sterols, required for the 

20 viability of eukaryotic cells, exist in the free form or 

as- esters conjugated to fatty acids. The concentration 

of free sterol determines the fluidity of eukaryotic cell 

membranes, whereas esterified sterols cannot participate 

in membrane assembly. The esterif ication of 

25 intracellular sterol, mediated in mammals by the 

membrane-bound enzyme, acylcoenzyme A: cholesterol 

« 

acyltransferase, is thus a critical homeostatic 
determinant of membrane function (1, 2) . For example, 
cholesterol depletion of the rough endoplasmic reticulum 

30 (ER) relative to the smooth ER (3), may modulate protein 

translocation or membrane-associated transcriptional 
activators such as the Sterol Response Element Binding 
proteins (SREBP, 4) . In addition, production of 
cholesterol ester (CE) by acylcoenzyme A: cholesterol 

35 acyltransferase in the rough ER may influence the 

transport of sterol between intracellular pools. Similar 
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esterif ication activities have been observed in other 

eukaryotes such as plants and yeasts (5) . 

Elevations in acylcoenzyme A: cholesterol acyltransf erase 
5 activity perturb several pathways that contribute to 
hyper lipideraia and atherosclerosis. Sterol 
esterif ication modifies the activity of the low density 
lipoprotein (LDL) receptor and alters serum lipoprotein 
composition to be pro-atherogenic (6, 7) . It may also be 

10 a rate limiting step in intestinal sterol absorption (8) . 

Furthermore, CE deposition in the arterial wall is an 
important initial ^ step ip atherogenesis (9). The 
understanding of the acylcoenzyme A: cholesterol 
acyltransferase reaction has been hampered by the 

15 difficulty of biochemical purification and by a poor 
grasp of the relevant genetic determinants. A human 
acylcoenzyme A: cholesterol acyltransferase I gene from 
macrophages was identified by complementation of Chinese 
Hamster Ovary cell lines deficient in acylcoenzyme A: 

20 cholesterol acyltransferase activity (10) and was 
functionally expressed in insect cells devoid of 
endogenous activity (11) . 



25 
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Summary of the Invention 

This invention provides an isolated nucleic acid which 
encodes an acylcoenzyme A: cholesterol acyltransferase II 
5 or an acylcoenzyme A: cholesterol acyltransferase III. 

This invention also provides a vector which includes the 
isolated nucleic acid which encodes an acylcoenzyme A: 
cholesterol acyltransferase II or an acylcoenzyme A: 
10 cholesterol acyltransferase III and a host vector system 
which includes a vector. 

/- 

This invention also provides a method , of producing a 
polypeptide which comprises growing such host vector 

15 system of claim 14 under suitable conditions permitting 
production of the polypeptide and recovering the 
polypeptide so produced. This invention also provides a 
purified wildtype acylcoenzyme A: cholesterol 
acyltransferase II or an acylcoenzyme A: cholesterol 

20 acyltransferase III . 

This invention also provides an oligonucleotide of at 
least 15 nucleotides capable of specifically hybridizing 
with a unique sequence of nucleotides present within a 

25 nucleic acid which encodes a wildtype acylcoenzyme A: 
cholesterol acyltransferase II or an acylcoenzyme A: 
cholesterol acyltransferase III without hybridizing to a 
nucleic acid which encodes a mutant acylcoenzyme A: 
cholesterol acyltransferase II or an acylcoenzyme A: 

30 cholesterol acyltransferase III. This invention also 

provides an oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within the nucleic acid 
which encodes a mutant acylcoenzyme A: cholesterol 

35 acyltransferase II or an acylcoenzyme A: cholesterol 
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acyltransferase III without hybridizing to a nucleic acid 
which encodes a wildtype acylcoenzyme A: cholesterol 
acyltransferase II or an acylcoenzyme A: cholesterol 
acyltransferase III- 

5 

This invention also provides a method for determining 
whether a subject known to have an imbalance in sterol 
levels has the imbalance due to a defect in 
esterification of sterol and for treating a subject who 
10 has an imbalance in sterol levels due to a defect in 

esterification of sterol. 

This invention also provides methods for inhibiting 
wildtype acylcoenzyme A: cholesterol acyltransferase II 
15 or an acylec >*ii/.yme A: cholesterol acyltransferase III in 

a subject. 

This invention also provides a method for identifying a 
chemical compound which is capable of inhibiting 
20 acylcoenzyme A: cholesterol acyltransferase II or an 

acylcoenzyme A: cholesterol acyltransferase III in a 
subject and a pharmaceutical composition comprising of 
the chemical compound so identified. 

25 This invention also provides a transgenic, nonhuman 

mammal comprising the isolated nucleic acid which encodes 
acylcoenzyme A: cholesterol acyltransferase II or an 
acylcoenzyme A: cholesterol acyltransferase III. 



30 
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Brief Descriptio n of the Figures 

Abbreviations: The amino acid residues are abbreviated 
as follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, 
5 Gly; H, His; I, He; K, Lys; L, Leu; M, Met; N, Asn; P, 
Pro; Q, Gin; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and 
Y, Tyr. CON: consensus sequence. 

Figures 1A and IB. Protein sequence alignments predicted 
1 0 from candidate genes for the human a cyl coenzyme A: 

cholesterol a cyl transferase gene I, the yeast homologs, 
acylcoenzyme A:/ cholesterol acyltransferase-related 
enzyme I and a cylcoenzyme A : cholesterol gey 1 trans f erase - 
related enzyme II, and a consensus sequence of all three 
1 5 sequences . 

Identical residues between all the sequences 
are in bold face. Residues of the candidate 
leucine zipper heptad motif are italicized. 
Potential transmembrane domains were identified 

20 at residues 132 to 155 and 460 to 483; 186 to 

202 and 406 to 421; and 215 to 231 and 439 to 
451, for human acylcoenzyme A: cholesterol 
acyltransf erase (Sequence I.D. No.: 2), 
acylcoenzyme A: cholesterol acyltransf erase- 

25 related enzyme I {Sequence I.D. No.: 4) and 

acylcoenzyme A: cholesterol acyltransferase- 
related enzyme II (Sequence I.D. No.: 6} , 
respectively. The firefly lucif erase signature 
sequences identified in human acylcoenzyme A: 

30 cholesterol acyltransf erase I (10) were not 

conserved in the yeast genes. CON (Sequence 
I.D. No.: 13) denotes the consensus sequence 
between the sequences of human acylcoenzyme A: 
cholesterol acyltransferase, acylcoenzyme A: 
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cholesterol acyltransf erase-related enzyme 1 
and acyl coenzyme A: cholesterol 
acyltransferase-related enzyme II. R07932 
denotes the partial sequence of another human 
5 acylcoenzyrae A: cholesterol acyltransf erase 

candidate cDNA (residues 500 to 600) (Sequence 
I.D. No.: 14). The asterisks indicate the 
residues in R07932 identical to those of the 
other sequences. 

10 1A. Alignment of amino acid residues 1-362 of 

acylcoenzyme A: cholesterol acyltransferase- 
related epzyme I-^nd the identical residues in 
acylcoenzyme A: cholesterol acyltransferase- 
related enzyme II, human acylcoenzyme A: 

15 cholesterol acyltransf erase and CON. 

IB. Alignment of amino acid residues 363-611 of 

acylcoenzyme A: cholesterol acyltransferase- 
related enzyme I and the identical residues in 
acylcoenzyme A: cholesterol acyltransf erase- 

20 related enzyme II, human acylcoenzyme A: 

cholesterol acyltransf erase and CON. 

Figures 2A, 2B, 2C, 2D and 2E. Construction and analysis 
of acylcoenzyme A: cholesterol acyl transferase genes and 
25 deletion mutants. 

2A. The arelANA deletion. The schematic depicts a 

fragment from yeast chromosome III in plasmid 
pH3(34). Strategic restriction endonucleases 
are indicated (H, Hind 117; B, Earn HI). 
30 2B. The autoradiogram depicts L*sa III digested DNA 

from wild-type or disrupted diploid strains 
probed with the 2993-bp Bam-HI fragment. This 
produced a fragment corresponding to the 
wild-type acylcoenzyme A: cholesterol 
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acyltransferase-related enzyme I locus and a 
1984-bp fragment characterizing the arelA NA 
allele. The diploid is heterozygous for the 
acylcoenzyme A: cholesterol acyltransferase- 
related enzyme I deletion. 

Reduced stringency hybridization of yeast 
genomic DNA with acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I coding 
sequences. Genomic DNA from wild- type or 
ARE1/ arelANA diploids were reprobed with an Nhe 
I-Avr II fragment corresponding to the 
acylcoenzyme A? cholesterol acyltransf erase- 
related enzyme I open reading- frame ("ORF") . 
Hybridizations and washes were performed at 
60°C in the absence of formamide. 
The are2A deletion. In step 1, PCR amplifying 
oligonucleotides, KO-5' and KO-3' and a LEU2 
template were used to produce the selectable 
yeast gene flanked at the 5' and 3' ends by 
acylcoenzyme A: cholesterol acyltransferase- 
related enzyme II. In step 2, this was used to 
direct homologous recombination at acylcoenzyme 
A: cholesterol acyltransferase-related enzyme 
II by transformation of a diploid strain and 
selection for leucine protrophy. In step 3, 
integrants to acylcoenzyme A: cholesterol 
acyltransferase-related enzyme II were 
identified by a PCR reaction using 
oligonucleotides flanking ARE2 (are2-5' and 
are2-3') and a 3' amplimer within LEU2 <L2-3' ) . 

A 999-bp fragment identifies are2A, as shown in 
the ethidium bromide stained agarose gel. The 
wild-type fragment (2206-bp) is also produced 
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in the same reaction. Leucine prototrophic 
trans formants with deletions of acylcoenzyme A: 
cholesterol acyltransf erase-related enzyme II 
were obtained at a frequency of -2%. M 
5 indicates the 50-2,000-bp ladder markers 

(Bio-Rad Laboratories) - 

Figures 3A and 3B. Fluorescent staining of triglyceride 
and sterol ester. 

10 The cells were grown in YEPD to stationary 

phase, washed with deionized H,0, and incubated 
with 1 Atg/ml Nile Red (1 mg/ml in acetone). 
Fluorescent images were obtained_.with a BioRad 
MRC600 laser scanning con-To-i-sl microscope 

15 (BioRad Microscience, Hercules., CA) on an 

inverted Zeiss Atiovert microscope (Zeiss, 
OberKochem, Germany) using 63X (NA1.4) Zeiss 
Plan-apo infinity corrected objective. Samples 
were illuminated with the 488nm line from an 

20 argon ion laser and the fluorescence was 

visualized with a 540nm dichroic mirror and 
550nm long-pass emission filter. Staining of 
the cytoplasmic lipid droplets was sensitive to 
treatment with isopropanol, proving them to be 

25 lipid in nature. 

3A. Wild-type cells. 

3B. arelANAare2A double mutant cells. 

Figures 4A, 4B, 4C and 4D. Neutral lipid and sterol 
30 biosynthesis in ARE deletion mutants. 

Strain genotypes are as described in the text; 
dpm/mg dry weight: disintegrations per minute 
per milligram of dry weight of cells. 
4A. Triglyceride biosynthesis. ■ Total lipids were 
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extracted from cells grown in media containing 
J H-oleate and analyzed by thin-layer 
chromatography . 
4B. Sterol ester biosynthesis. Total lipids were 

5 extracted from cells grown in media containing 

5 H-oleate and analyzed by thin-layer 
chromatography . 
4C. Sterol ester biosynthesis in wild-type and 

mutant cells transformed with vector control 

10 (black box) or acylcoenzyme A: cholesterol 

acyl trans ferase-rela ted enzyme I 
over-expression plasmids, YEp3-16 (increased 
copy number, shaded box) and pADH5-36 
(transcription from the ADH promoter , open 

15 boxes) . Cells were grown in selective media to 

maintain the acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I expression 
plasmids. Lipids were labeled, extracted and 
analyzed as above. 

20 4D. Sterol biosynthesis in acylcoenzyme A: 

cholesterol acyltransferase-related enzyme 
deletion mutants. Lipids were labeled in 
synthetic complete media containing [1- M C] 
acetate, saponified and extracted with hexane 

25 and subjected to thin layer chromatography 

analysis. The data is representative of three 
separate experiments and expressed as the ratio 
of incorporation into sterols to incorporation 
into fatty acids. 

30 

Figures 5A, 5B, 5C, 5D, 5E and 5F. The nucleic acid and 
amino acid or predicted amino acid sequences. 

5A-1 - 5A-3. 

The nucleic acid sequence of human acylcoenzyme 
35 A: cholesterol acyltransf erase I designated 
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Sequence ID No.: 1. The amino acid sequence of 
human acylcoenzyme A: cholesterol 
acyltransferase I designated Sequence ID No. : 
2. 

5A-1. Nucleic acid sequence of human 

acylcoenzyme A: cholesterol 
acyltransferase I from nucleic acid 
bases 1-1624. Amino acid sequence of 
human acylcoenzyme A: cholesterol 
acyltransferase I from amino acid 
residues 1-76. 

5A-2. ^Nucleic^ acid sequence of human 
acylcoenzyme A: cholesterol 
acyltransferase I from nucleic acid 
bases 1625-2524. Amino acid sequence 
of human acylcoenzyme A: cholesterol 
acyltransferase I from amino acid 
residues 77-376. 

5A-3. Nucleic acid sequence of human 

acylcoenzyme A: cholesterol 
acyltransferase I from nucleic acid 
bases 2525-3649. Amino acid sequence 
of human acylcoenzyme A: cholesterol 
acyltransferase I from amino acid 
residues 377-551. 

5B-3. 

The nucleic acid sequence of yeast acylcoenzyme 
A: cholesterol acyltransferase-related enzyme 
I designated Sequence ID No.: 3. The amino 
acid sequence of yeast acylcoenzyme A: 
cholesterol acyltransferase-related enzyme I 
designated Sequence ID No.: 4. 
5B-1. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransferase- 
related enzyme I from nucleic acid 
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bases 1-1289. Amino acid sequence of 
acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I from 
amino acid residues 1-209. 

5B-2 . Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransferase- 
related enzyme I from nucleic acid 
bases 1290-2114. Amino acid sequence 
of acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I from 
amino acid residues 210-484. 

5B-3. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransferase- 
related enzyme I from nucleic acid 
bases 2115-2601. Amino acid sequence 
of acylcoenzyme A: . cholesterol 
acyltransferase-related enzyme I from 
amino acid residues 485-611. 

-3. 

The nucleic acid sequence of yeast acylcoenzyme 
A: cholesterol acyltransferase-related enzyme 
II designated Sequence ID No. : 5. The amino 
acid sequence of yeast acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II 
designated Sequence ID No.: 6. 
5C-1. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransferase- 
related enzyme II from nucleic acid 
bases 1-1061. Amino acid sequence of 
acylcoenzyme A: cholesterol 
acyltransferase-related enzyme II 
from amino acid residues 1-238. 
5C-2 . Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransferase- 
related enzyme II from nucleic acid 
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bases 1062-1961. Amino acid sequence 
of acylcoenzyme A: cholesterol 
acyltransf erase-related enzyme II 
from amino acid residues 239-538. 
5 5C-3. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransf erase- 
related enzyme II from nucleic acid 
bases 1962-2421. Amino acid sequence 
of acylcoenzyme A: cholesterol 
10 acyltransf erase-related enzyme II 

from amino acid residues 539-643. 

5D. The nucleic acid sequence of mouse acylcoenzyme 

A: cholesterol acyltransf erase II designated 
15 Sequence ID No.: 11. The amino acid sequence 

of mouse acylcoenzyme A: cholesterol 
acyltransf erase II designated Sequence ID No.: 
12. 

20 Figure 6A. A restriction map of the expression vector 

YepAB-ACAT2 . 

Figure 6B and 6C. Expression of human macrophage ACAT 
in pRS426GP. 

25 6B. The ACAT open reading frame was inserted 

at the WotI and Sad sites, downstream of 
the promoter of the GAL1/10 gene 
(GALl/lOp) as described in the text to 
produce pRS426-ACAT. URA3 and Amp r denote 

30 selectable markers for yeast and E. coli 

respectively. The yeast and bacterial 
origins of replication (2/im and ori, 
respectively ) are indicated. 
6C. Immunoblot of human ACAT in protein 
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extracts from cells transformed with 
pRS426-ACAT. Double mutant cells (arel 
are2~), transformed with pRS426-ACAT 
(hACAT) or with pRS426GP (vector), were 
5 induced by growth in galactose. Proteins 

were analyzed by immunoblotting . 
Equivalent amounts of protein extracts 
from mouse adrenal cells were loaded for 
comparison. Molecular weight reference 

10 markers (BioRad) are indicated (M) . The 

arrow indicates the position of the DM10 
impiunoreaqtive product in extracts from 
murine adrenals. The expressed form of 
hACAT in yeast is of coincident mobility. 

15 Figures 7A and 7B. Multiple human tissue Northern 

analysis of poly (A)+ RNAs probed with 32 P- labeled cDNA 
CI. 

7A. Tissue specific expression of wildtype human 

acylcoenzyme A: cholesterol acyl transferase II 
20 using a wildtype acylcoenzyme A: cholesterol 

acyltransf erase II specific probe. 
7B. Tissue specific expression of wildtype human 

acylcoenzyme A: cholesterol acyltransferase I 
using a wildtype acylcoenzyme A: cholesterol 
25 acyltransferase I specific probe. 

Figure 8A, 8B, 8C and 8D. Tissue specific expression of 
ARGP1 and hACAT. 

8A and 8B. Multiple tissue Northerns (Clontech) with 

30 indicated samples were probed with an 

ARGP1 specific probe as described in the 
text. 

8C and 8D. The same blots were also analyzed using a 

hACAT specific probe. : The first panel is 
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identical to that published by Chang et al 
(S) . The second panel is the same blot as 
in A and B, probed with the ACAT cDNA 1600 
bp probe. 

5 

Figure 9. Fetal Tissue specific expression of AGRP2. 

Multiple tissue Northerns of fetal tissue (Clontech) 
with indicated samples, were probed with and AGRP2 
specific probe as described in the text. 

10 

Figure 10. Cultured cell expression of AGKP1. 

RNA samples /from H*epG2 and CV1 were reverse 
transcribed and PCR amplified as described in the 
text. P indicate a plasmid template control. The 
15 blank lanes represent water or no RT controls. 

Figure 11. Sequence comparison of human ACAT and AGRP1 
Figure 12. Sequence comparison of human ACAT and AGRP2 

20 

Figure 13. Phylogenetic Comparisons of ACAT like 
molecules . 

The sequences shown were identified in genome 
databases and aligned based on protein sequence 
25 using GCG Inc software (pileup) . They were 

subsequently arranged to their sequence conservation 
to determine approximate evolutionary relatedness. 

Figure 14 . Conserved motifs in ACAT relate gene 
30 products. 



Figure 15A and 15B. Nucleotide and predicted protein 
sequence of ARGP1 . 
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Figure 16. Nucleotide and predicted protein sequence of 
ARGP2 . 
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Detailed Description of the Invention 

Throughout this application, references to specific 
nucleotides are to nucleotides present on the coding 
5 strand of the nucleic acid. The following standard 

abbreviations are used throughout the specification to 
indicate specific nucleotides: 

C=cytosine A=adenosine 
10 T=thyraidine G=guanosine 

A "gene" means a nucleic acid molecule, the sequence of 
which includes all the information required for the 
normal regulated production of a particular protein, 
15 including the structural coding sequence, promoters and 

enhancers. 

The nucleic acids or oligonucleotides of the subject 
invention also include nucleic acids or oligonucleotides 
20 coding for polypeptide analogs, fragments or derivatives 

which differ from naturally-occurring forms in terms of 
the identity or location of one or more amino acid 
residues (deletion analogs containing less than all of 
the residues specified for the protein, substitution 
25 analogs wherein one or more residues specified are 
replaced by other residues and addition analogs where in 
one or more amino acid residues is added to a terminal or 
medial portion of the polypeptides) and which share some 
or all properties of naturally-occurring forms. These 
30 nucleic acids or oligonucleotides include: the 
incorporation of codons "preferred" for expression by 
selected non-mammalian hosts; the provision of sites for 
cleavage by restriction endonuclease enzymes; and the 
provision of additional initial, terminal or intermediate 
35 DNA sequences that facilitate construction of readily 
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expressed vectors. 

The nucleic acids and oligonucleotides described and 
claimed herein are useful for the information which they 
5 provide concerning the amino acid sequence of the 

polypeptide and as products for the large scale synthesis 
of the polypeptide by a variety of recombinant 
techniques. The molecule is useful for generating new 
cloning and expression vectors, transformed and 
10 transfected prokaryotic and euJcaryotic host cells, and 

new and useful methods for cultured growth of such host 
cells capable of, expression of the polypeptide and 
related products. 

15 An isoJ i tod nucleic acid which encodes an acylcoenzyme A: 

cholesterol acyltransf erase II. This isolated nucleic 
acid may be DNA or RNA, specifically cDNA or genomic DNA. 
Specifically, the isolated nucleic acid has the sequence 
designated Seq. I.D. No.: 7. The isolated nucleic acid 

20 encodes a human wildtype acylcoenzyme A: cholesterol 

acyltransf erase II having substantially the same amino 
acid sequence as the sequence designated Seq. I.D. No.: 
8. Specifically the isolated nucleic acid has the 
sequence designated Seq. I.D. No.: 11. The isolated 

25 nucleic acid encodes a mouse wildtype acylcoenzyme A: 

cholesterol acyltransf erase II having substantially the 
same amino acid sequence as the sequence designated Seq. 
I.D. No.: 12. Further, the isolated nucleic acid of 
encodes a mutant acylcoenzyme A: cholesterol 

30 acyltransferase II. 

An isolated nucleic acid which encodes an acylcoenzyme A: 
cholesterol acyltransferase III. This isolated nucleic 
acid may be DNA or RNA, specifically cDNA or genomic DNA. 
35 Specifically, the isolated nucleic acid has the sequence 
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as set forth in Fig. 16. The isolated nucleic acid 
encodes a human wildtype acylcoenzyme A: cholesterol 
acyltransf erase III having substantially the same amino 
acid sequence as set forth in Fig. 16. Further, the 
5 isolated nucleic acid of encodes a mutant acylcoenzyme A: 
cholesterol acyltransf erase III. 

As used in this application, "acylcoenzyme A: cholesterol 
acyltransf erase III" means and includes any polypeptide 

10 having acylcoenzyme A: cholesterol acyltransferase III 
activity and having an amino acid sequence homologous to 
the amino acid /sequence* of human acylcoenzyme A: 
cholesterol acyltransferase II (the sequence of which is 
set forth in Fig. 15) . Thus, this term includes any such 

15 polypeptide whether naturally occurring and obtained by 
purification from natural sources or non-naturally 
occurring and obtained synthetically, e.g. by recombinant 
DNA procedures.' Moreover, the term includes any such 
polypeptide whether its sequence is substantially the 

20 same as, or identical to the sequence of any mammalian 
homolog of the human polypeptide, e.g. murine, bovine, 
porcine, etc. homologs. Additionally, the term includes 
mutants or other variants of any of the foregoing which 
retain at least some of the enzymatic activity of 

25 nonmutants or nonvariants. 

As used in this application, "acylcoenzyme A: cholesterol 
acyltransf erase II" means and includes any polypeptide 
having acylcoenzyme A: cholesterol acyltransferase II 

30 activity and having an amino acid sequence homologous to 

the amino acid sequence of human acylcoenzyme A: 
cholesterol acyltransferase III (the sequence of which is 
set forth in Fig. 16). Thus, this term includes any such 
polypeptide whether naturally occurring and obtained by 

35 purification from natural sources 1 or non-naturally 
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occurring and obtained synthetically, e.g. by recombinant 
DNA procedures. Moreover, the term includes any such 
polypeptide whether its sequence is substantially the 
same as, or identical to the sequence of any mammalian 
5 homolog of the human polypeptide, e.g. murine, bovine, 

porcine, etc. homologs. Additionally, the term includes 
mutants or other variants of any of the foregoing which 
retain at least some of the enzymatic activity of 
nonmutants or nonvariants. 

10 

The invention also encompasses DNAs and cDNAs which 
encode amino aciji. sequences which differ from those of 
acylcoenzyme A: cholesterol acyl transferase II, but which 
do not produce phenotypic changes. 

15 

The invention also encompasses DNAs and cDNAs which 
encode amino acid sequences which differ from those of 
acylcoenzyme A: cholesterol acyltransf erase III, but 
whdch do not produce phenotypic changes. 

20 

The nucleic acid of the subject invention also include 
nucleic acids that encode for polypeptide analogs, 
fragments or derivatives which differ from naturally- 
occurring forms in terms of the identity or location of 

25 one or more amino acid residues (including deletion 

analogs containing less than all of the residues 
specified for the protein, substitution analogs wherein 
one or more residues specified are replaced by other 
residues and addition analogs wherein one or more amino 

30 acid residues is added to a terminal or medial portion of 
the polypeptides) and which share some or all properties 
of the naturally-occurring forms. 

The polypeptide of the subject invention also includes 
35 analogs, fragments or derivatives * which differ from 
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naturally-occurring forms, but having acylcoenzyme A: 
cholesterol acyltransf erase activity. 

This invention also provides a vector comprising an 
5 isolated nucleic acid encoding acylcoenzyme A: 

cholesterol acyltransferase II or III. The isolated 
nucleic acid of the vectors is operatively linked to a 
promoter of RNA transcription which maybe, or is 
identical to, a bacterial, yeast, insect or mammalian 
10 promoter. The vector may be a plasmid, cosmid, yeast 
artificial chromosome (YAC) , bacteriophage or eukaryotic 
viral DNA. Specifically* this invention provides a 
vector designated Y epAB- ACAT 2 (Figure 6) . 

15 Further other numerous vector backbones known in the art 

as useful for expressing proteins may be employed. Such 
vectors include but are not limited to: adenovirus, 
simian virus 40 (SV40) , cytomegalovirus (CMV) , mouse 
mammary tumor virus (MMTV) , Moloney murine leukemia 

20 virus, murine sarcoma virus, and Rous sarcoma virus, DNA 

delivery systems, i.e liposomes, and expression plasmid 
delivery systems. 

This invention also provides a vector system for the 
25 production of a polypeptide which comprises the vector in 

a suitable host. Suitable host includes a cell which 
includes, but is not limited, prokaryotic or eukaryotic 
cells, e.g. bacterial cells {including gram positive 
cells), yeast cells, fungal cells, insect cells and 
30 animal cells. 

Suitable animal cells include, but are not limited to, 
HeLa cells, Cos cells, CV1 cells and various primary 
mammalian cells. Numerous mammalian cells may be used as 
35 hosts, including, but not limited to, the mouse 
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fibroblast cell NIH 3T3, CHO cells, Ltk" cells, etc. 
Expression plasmids such as that described supra may be 
used to transfect mammalian cells by methods well known 
in the art such as calcium phosphate precipitation, 
5 electroporation . 

This invention also provides a method for producing a 
polypeptide (e.g. acylcoenzyme A: cholesterol 
acyl transferase) which comprises growing a host vector 

10 system under suitable conditions permitting production of 

the polypeptide and recovering the polypeptide so 
produced. Method^ of repovering polypeptides produced in 
such host vector systems are well-known, in the art and 
typically include steps ;*.n* living cell lysis, 

15 solubilization and chromatography. 

This invention also provides a method of obtaining a 
polypeptide in purified form which comprises: (a) 
introducing a vector, as described above, into a suitable 

20 host cell; (b) culturing the resulting cell so as to 

produce the polypeptide; (c) recovering the polypeptide 
produced in step (b) ; and (d) purifying the polypeptide 
so recovered. As discussed above the vector may include 
a plasmid, cosmid, yeast artificial chromosome, 

25 bacteriophage or eukaryotic viral DNA. Also, the host 

cell may be a bacterial cell (including gram positive 
cells), yeast cell, fungal cell, insect cell or animal 
cell. Suitable animals cells include, but are not 
limited to HeLa cells, Cos Cells, CV1 cells and various 

30 primary mammalian cells. Culturing methods useful for 

permitting transformed or transfected host cells to 
produce polypeptides are well known in the art as are the 
methods for recovering polypeptides from such cells and 
for purifying them. 

35 
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Using the aforementioned method, this invention also 
provides a purified wildtype acylcoenzyme A: cholesterol 
acyltransf erase II or III and a purified mutant 
acylcoenzyme A: cholesterol acyltransf erase II or III. 

5 

This invention also provides an oligonucleotide of at 
least 15 nucleotides capable of specifically hybridizing 
with a unique sequence of nucleotides present within a 
nucleic acid which encodes a wildtype acylcoenzyme A: 

10 cholesterol acyltransf erase II or III without hybridizing 

to a nucleic acid which encodes a mutant acylcoenzyme A: 
cholesterol acyltransf erase II or III. Further, this 
invention also provides an oligonucleotide of at least 15 
nucleotides capable of specifically hybridizing with a 

15 unique sequence of nucleotides present within the nucleic 

acid which encodes a mutant acylcoenzyme A: cholesterol 
acyltransf erase II or III without hybridizing to a 
nucleic acid which encodes a wildtype acylcoenzyme A: 
cholesterol acyltransf erase II or III. These 

20 oligonucleotide DNA or RNA. Such oligonucleotides may be 

used in accordance with well known standard methods for 
known purposes, for example, to detect the presence in a 
sample of DNA which will hybridize thereto. 

25 The oligonucleotides include, but are not limited to, 

oligonucleotides that hybridize to mRNA encoding 
acylcoenzyme A: cholesterol acyltransf erase II or III so 
as to prevent translation of the protein. 

30 This invention also provides a nucleic acid having a 

sequence complementary to the sequence of the isolated 
nucleic acid which encodes acylcoenzyme A: cholesterol 
acyltransf erase II or III. 



35 



This invention also provides a method for determining 
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whether a subject known to have an imbalance in sterol 
levels has the imbalance due to a defect in 
esterif ication of sterol which comprises (a) obtaining 
from the subject an appropriate sample containing a 
5 mixture of all of the subject's nucleic acids; and (b) 
determining whether any nucleic acid in the sample from 
step (a) is, or is derived from, a nucleic acid which 
encodes a mutant acylcoenzyme A: cholesterol 
acyltransf erase so as to thereby determine whether the 

10 subject's imbalance in sterol levels is due to a defect 

in esterif ication of sterol. The determination step (b) 
may comprises: (1^. contacting the sample of step (a) with 
the isolated nucleic acid which encodes acylcoenzyme A: 
cholesterol acyltransferase II or III or the 

15 oligonucleotide of at least 15 nucleotides capable of 

specifically hybridizing with a unique sequence of 
nucleotides present within a nucleic acid which encodes 
a wildtype acylcoenzyme A: cholesterol acyltransferase II 
or. Ill without hybridizing to a nucleic acid which 

20 encodes a mutant acylcoenzyme A: cholesterol 

acyltransferase II or III under conditions permitting 
binding of any nucleic acid in the sample which is, or is 
derived from, a nucleic acid which encodes a mutant 
acylcoenzyme A: cholesterol acyltransferase to the 

25 nucleic acid or oligonucleotide so as to form a complex; 

(ii) isolating the complex so formed; and (iii) 
identifying the nucleic acid in the isolated complex so 
as to thereby determine whether any nucleic acid in the 
sample contains a nucleic acid which is, or is derived 

30 from, a nucleic acid which encodes a mutant acylcoenzyir.^ 

A: cholesterol acyltransferase II or III. In this 

method, both the isolation of any complex formed are 
effected using standard methods well known in the art. 

35 In order to facilitate identification of the nucleic acid 
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from step (a) the isolated nucleic acid or the 
oligonucleotide is labeled with a detectable marker. The 
detectable marker may be a radioactive isotope, a 
fluorophore or an enzyme. In additions, the nucleic acid 
5 sample may be bound to a solid matrix before performing 

step (I). 

This invention also provides a method for treating a 
subject who has an imbalance in sterol levels due to a 

10 defect in esterif ication of sterol which comprises 

introducing an isolated nucleic acid which encodes a 
wildtype acylcoenzyme A: ..cholesterol acyl transferase II 
or III into the subject under conditions^ such that the 
nucleic acid expresses a wildtype acylcoenzyrae A: 

15 cholesterol acyltransf erase II or III, so as to thereby 

treat the subject. 

This invention - also provides a method for inhibiting 
wildtype acylcoenzyrae A: cholesterol acyltransf erase II 

20 or III in a subject which comprises transforming 

appropriate cells from the subject with a vector which 
expresses the nucleic acid complementary to the isolated 
nucleic acid which encodes a wildtype acylcoenzyme A: 
cholesterol acyltransferase II or III, and introducing 

25 the transformed cells into the subject so as to thereby 

inhibit wildtype acylcoenzyme A: cholesterol 
acyltransferase II or III. Further, in a preferred 
embodiment, the nucleic acid is capable of specifically 
hybridizing to a mRNA molecule encoding acylcoenzyme A: 

30 cholesterol acyltransferase II or III so as to prevent 

translation of the mRNA molecule. 

This invention also provides a method for inhibiting the 
wildtype acylcoenzyme A: cholesterol . acyltransf erase II 
35 or III in a subject which comprises introducing an 
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oligonucleotide of at least 15 nucleotides capable of 
specifically hybridizing with a unique sequence of 
nucleotides present within a nucleic acid which encodes 
a wildtype acylcoenzyme A: cholesterol acyl transferase II 
5 or III without hybridizing to a nucleic acid which 

encodes a mutant acylcoenzyme A: cholesterol 
acyltransf erase II or III into the subject so as to 
thereby inhibit the wildtype acylcoenzyme A: cholesterol 
acyltransf erase II or III . The oligonucleotide is 
10 capable of specifically hybridizing to a mRNA molecule 

encoding acylcoenzyme A: cholesterol acyltransf erase II 
or III so as to prevent translation of the mRNA molecule. 

This invention also provides for a method for identifying 
15 a chemical compound which is capable of inhibiting 

acylcoenzyme A: cholesterol acyltransf erase II or III in 
a subject which comprises (a) contacting a wildtype 
acylcoenzyme A: cholesterol acyl transferase II or III 
wi£h the chemical compound under conditions permitting 
20 binding between the acylcoenzyme and the chemical 

compound (b) detecting specific binding of the chemical 
compound to the acylcoenzyme; and © determining whether 
the chemical compound inhibits the activity of the 
coenzyme so as to identify a chemical compound which is 
25 capable of inhibiting acylcoenzyme A: cholesterol 

acyltransf erase II or III in a subject. 

This invention also provides method for differentially 
inhibiting one acylcoenzyme A: cholesterol 

30 acyltransf erase but not others using the above methods. 

In an embodiment/ only acylcoenzyme A: cholesterol 
acyltransf erase I is inhibited. In another embodiment 
only acylcoenzyme A: cholesterol acyltransf erase II 
(ARGP1) is inhibited. In an another embodiment only 

35 acylcoenzyme A: cholesterol acyltransf erase III (ARGP2) 
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is inhibited. Alternatively/ two of the acylcoenzyme A: 
cholesterol acyltransf erases may be inhibited. This 
invention further provides pharmaceutical compositions 
which will differentially inhibit one or more 
5 acylcoenzyme A: cholesterol acyltransf erases . 

This invention also provides for a pharmaceutical 
composition comprising the chemical compound identified 
by the above-described method in an amount effective to 
10 inhibit acylcoenzyme A: cholesterol acyltransf erase II or 

III in a subject and a pharmaceutical^ effective 
carrier. 

/ - 

This invention also provides a methocP of treating a 
15 subject who has atherosclerosis comprising the above- 

described pharmaceutical composition. A method of 
treating a subject who has hyperlipidemia comprising the 
above-described pharmaceutical composition. 

20 This invention also provides a transgenic, nonhuman 

mammal comprising the isolated nucleic acid which encodes 
acylcoenzyme A: cholesterol acyltransf erase II or III. 
The mammal includes, but is not limited to, a mouse, 
bovine, cat or dog. 

25 

This invention is illustrated in the Experimental Details 
section which follows. These sections are set forth to 
aid in an understanding of the invention but are not 
intended to, and should not be construed to, limit in any 
30 way the invention as set forth in the claims which follow 

thereafter . 
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Experimental Details 

First Series of Experiments 

Example 1: 

5 Materials and Methods: 

Transformation of yeast was performed with lithium 
acetate (15) by amino-acid prototrophy selection. A 
diploid strain (5051) was constructed between two 

10 isogenic derivatives of W303 (16); W1346-3C [MATa, 

ade2-l, canl-100, his3-ll, 15, leu2-3, 112, trpl-1, 
ura3-l) and W1J34-2C ^(MATa, canl-100, his3-ll, 15, 
leu2-3, 112, trpl-1, ura3-l, metl4DHpaI-SalI) . Growth on 
complete (YEPD) or synthetic medium, sporulation and 

15 dissection was performed as described (17) . 

Competent cells of Escherichia coli strain DH5a 
(Gibco^BRL) and DNA modifying enzymes (Promega) were used 
according to the manufacturers instructions. pH3(34), 

20 from L.A. Grivell, was digested with Nhe I, blunt-ended 

with Klenow sequences, and digested with Avr II to 
liberate a 1614-bp fragment. An Xba I, Sma I fragment of 
pJH-Hl encoding the HIS3 gene was then inserted at these 
sites in the vector backbone to produce the arelANA 

25 allele. This construct was digested with Bsa I to 

liberate a 3821-bp fragment which was then transformed 
into strain 5051. Disruption of ARE1 was confirmed by 
Southern blot analysis. 

30 Radioactive probes of acylcoenzyme A: cholesterol 

acyltransferase-related enzyme I were prepared by random 
priming (Pharmacia) with »P-dCTP. Genomic DNA (18) was 
transferred to Hybond membranes (Amersham) and hybridized 
in the absence of formamide at 65° or 60°C (19) . 
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A shotgun library of cosraid 14-21 from chromosome XIV 
(Peter Philippsen, Biozentrum Basel) was constructed 
using the nebulizing technique (20) . The DNA was 
nebulized (90 seconds, 2 bars), size fractionated, 
5 treated with DNA polymerase I (Klenow fragment) and T4 

DNA polymerase and blunt-end ligated into pTZ18R 
(Pharmazia, Germany) . Nucleotide sequencing was 

performed by dideoxy-chain-termination with 
digoxigenin-labeled reverse primer and Sequenase (United 

10 States Biochemical) . The reactions were analyzed on the 

GATC 1500 direct blotting electrophoresis system (GATC 
GmbH, Germany) using the Boehringer-Mannheim 
Dig-development protocol. Sequences were aligned by 
SeqMan (DNA Star Inc.). Database searching was performed 

15 with BLAST (21) and GCG Inc. software (22). The DNA 

sequence of the acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I and acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II genes are 
deposited at GenBank (P25628 and U51790, respectively) . 

20 

KO-5 and K 0 - 3 ' primers 

(GAGGGGACGAAAATTAGCCGCTATTAATTCTGGTATTGCCACCTAGACAAGAAG 
TAAACAGACACAGATGcaagagttcgaatctcttagc (Sequence ID No.: 
15) and CTATAAAGATTTAATAGCTCCACAGAACAGTTGCAGGATGCCTTAGGGT 

25 CGActacgtcgtaaggccgtttctgac (Sequence ID No.: 16), 

respectively; lower case: corresponds to the LEU2 gene) 
were used in a PCR with the LEU2 gene as a template to 
produce the selectable yeast gene flanked by acylcoenzyme 
A: cholesterol acyltransferase-related enzyme II gene 

30 sequences (23) . This was used to transform a derivative 

of yeast strain 5051, heterozygous for the arelANA 
allele. To identify integrants at the acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II locus, a 
PCR was performed on genomic DNA from these strains using 
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are2-5' (CATTGCAGTTACACGTGAATGC) (Sequence ID No.: 17), 
are2-3' : (TAGCTCCACAGAACAGTTGCAGG) (Sequence ID No. : 18) 
and a 3' amplimer corresponding to the LEU2 gene (L2-3' 
CTCTGACAACAACGAAGTCAG) (Sequence ID No. : 19) . 



1-2 units at an absorbance of 6000nm of cells were 
incubated in YEPD or defined media containing 1 ^Ci/ml 
**H-oleate in tyloxapol/ethanol (1:1) for 16 hours. Total 
lipids were prepared by hexane extraction (25) and 
analysed by thin layer chromatography on DC-plastikfolien 
kieselgel 60 plates (E-Merck, Germany) . The plate was 
developed in hexane, diethyl ether and acetic acid 
(70:30:1) and stained with iodine vapor. Incorporation 
of label into triglyceride and ergosterol ester was 
ascertained following scintillation counting and 
normalization to a "C-cholesterol internal standard and 
the dry weight of the cells. 

To overexpress the acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I gene by copy number 
under the control of its own promoter in YEp3-16, a 2354 
bp Cla I fragment from pH3(34), encompassing the entire 
acylcoenzyme A: cholesterol acyltransferase-related 
enzyme I gene, was made blunt-ended with Klenow DNA 
polymerase I and introduced into the Sma I site of 
YEp352. To constitutively overexpress acylcoenzyme A: 
cholesterol acyltransferase-related enzyme I from the ADH 
promoter in pADH5-36, a 2290 bp Nar I fragment of 
pH3(34), starting 70 bp 5' to the ORF was blunt-ended 
with Klenow and ligated to Klenow-treated, Eco RI 
digested, pDC-ADH (a derivative of pS5) (26) . Increased 
expression of the acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I transcripts, relative to 
a wild-type cell, was confirmed * by northern blot 
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analysis . 

The incorporation of (1- M C] acetate into saponified 
lipids was assessed as a measurement of sterol synthesis. 
5 Approximately 2 OD COO units of cells were incubated with 

20 MCi [1-"C] acetate in 2 ml defined media at 30°C for 
3 hours and subjected to lipid saponification, hexane 
extraction and TLC chromatography (29) . The 
incorporation of counts into total sterols were assessed 
10 following scintillation counting. To normalize the 

estimate of sterol biosynthesis to incorporation of 
acetate into the^fatty .acid pool, the aqueous lysate 
remaining after hexane extraction was ^cidified with 
rx:;cent rated HC1 and re-extracted with hexane (30) . 

15 

Experimental Discussion 

To use yeast genetics to study sterol esterif ication, the 
human acylcoenzyme A: cholesterol acyltransf erase 

20 sequence was used to search for homologous yeast genes 

and subsequently to identify an additional human isoform 
(Figures 1A and IB) . Acylcoenzyme A: cholesterol 
acyltransf erase related enzyme I, an 1830-bp open reading 
frame (ORF) on yeast chromosome III, encodes a 610- 

25 residue protein with 23% identity and 49% similarity to 

human acylcoenzyme A: cholesterol acyltransferasel 
(Figures 1A and IB). The yeast and human proteins 
possess leucine zipper motifs that could mediate 
protein-protein interactic "s (esterif ication is probably 

30 performed by a multimeric complex) (12), and possess at 

least two predicted transmembrane domains that may 
mediate the membrane association of the acylcoenzyme A: 
cholesterol acyltransf erase reaction (13, 14). 

35 To define the role of acylcoenzyme A: cholesterol 
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acyltransferase-related enzyme I in sterol 
esterification, the deletion mutant, arelANA, was 
generated by homologous recombination (15, 16, 17) (Fig. 
2A) . In a diploid strain, a 1614-bp segment of one 
5 acylcoenzyme A: cholesterol acyltransferase-related 
enzyme I allele was replaced with the HIS3 gene and 
confirmed by Southern hybridization (Fig. 2B) . Analysis 
of mutant and wild-type haploid progeny from this diploid 
indicated no differences in growth rates or incorporation 
10 of 3 H-oleate into ergosterol ester. 

The lack of a defect in "sterol esterif ication in arelANA 
strains could result from alternated esterif ication K 
activities. Reduced stringency hybridization of yeast 

15 genomic DNA with the acylcoenzyme A: cholesterol 

acyltransferase-related enzyme I coding sequence as a 
probe indicated that additional homologous sequences were 
present (18, 19). A Bam HI digestion of genomic DNA 
produced the predicted 2.9-kb acylcoenzyme A: cholesterol 

20 acyltransferase-related enzyme I fragment and a -6.0-kb 

hybridizing fragment (Fig. 2C) . Contour clamped 
homogeneous electric field electrophoretic analysis of 
yeast chromosomes suggested the latter sequence was 
localized to chomosome X or XIV. On the basis of 

25 homology to acylcoenzyme A: cholesterol acyltransferase- 

related enzyme I, this gene, designated acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II, encodes a 
second yeast homolog to human acylcoenzyme A: cholesterol 
acyltransferasel (Figures 1A and IB). The genomic 

30 sequence (20, 21, 22) encompassing acylcoenzyme A: 

cholesterol acyltransferase-related enzyme II on 
chromosome XIV predicts a 5997-bp Bam HI fragment and a 
1929-bp ORF, which translates into a 643-residue 
polypeptide. The yeast acylcoenzyme A: cholesterol 
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acyltransf erase related enzymes genes are 61% and 49?, 
identical at the DNA and predicted protein levels, 
respectively. Arelp, Are2p and the human acylcoenzyme A: 
cholesterol acyltransf erasel protein are most related at 
5 the COOH-terminal region (42% identity over a 90-residue 

sequence) (Figures 1A and IB) . 

To assess the contribution of Are2p to sterol 
esterification, one copy of the acylcoenzyme A: 

10 cholesterol acyltransf erase-related enzyme II coding 

sequence was deleted from the genome of an ARE1 /arelANA 
heterozygous diploid by- a polymerase chain reaction 
approach (23) (Fig. 2D) . Haploid progeny representing 
the single arelANA and are2A deletions and the arelA 

15 NAare2A double mutant were obtained. To ascertain the 

effect of deletion of acylcoenzyme A: cholesterol 
acytransferase-related enzymes genes upon cytoplasmic 
lipid storage, the neutral lipid components (triglyceride 
and sterol ester) of the yeast cells were detected by 

20 fluorescence microscopy after staining with Nile Red 

(24) . In wild-type cells, cytoplasmic fluorescent 
droplets accumulated in stationary phase cultures (Fig. 
3A) . No differences in are single mutants were detected. 
However, the number of droplets observed in arelANAare2A 

25 double mutants, was one-third to that in wild-type 

strains (Fig. 3B; over multiple fields, 5.57 ± 2.73 vs. 
16.73 ± 4.6 droplets/cell, P<0.05). 

The wild-type and are mutant cells were analyzed for the 
30 incorporation of 'H-oleate into sterol ester (25) (Fig. 

4B) . No significant differences in triglyceride 
biosynthesis were detected. In contrast to normal sterol 
ester biosynthesis observed in . arelANA mutants, 
deficiencies in sterol esterification were apparent in 
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both are2A and arelANAare2A mutants. These were detected 
by iodine vapor staining of thin layer chromatographs of 
total yeast lipids in addition to the oleate 
incorporation assays. Sterol ester levels of are2A 
5 single mutants were reduced to less than 26% of wild-type 

suggesting the acylcoenzyme A: cholesterol 
acyltransferase-related enzyme II isoform to confer the 
majority of acyltransf erase activity. The arelANAare2A 
double mutant was almost totally deficient in sterol 
10 esterification (less than 1% of wild-type levels) . In 

confirmation of the critical role of Are proteins in 

/ - 

sterol esterification, microsomes from double mutant 
yeast cells lacked acylcoenzyme TV: cholesterol 
acyltransf erase activity when assayed in vitro. 

15 

To confirm that the protein encoded by an acylcoenzyme A: 
cholesterol acyltransferase-related enzymes ORF was 
sufficient for sterol esterification, the acylcoenzyme A: 
cholesterol acyltransferase-related enzyme I coding 

20 sequence was over-expressed in vectors with increased 

copy number <YEp3-16) or elevated transcription (the 
alcohol dehydrogenase promoter in pADH5-36) (26) . There 
were no detectable changes in triglyceride or 
phospholipid biosynthesis resulting from acylcoenzyme A: 

25 cholesterol acyltransferase-related enzyme I 

over-expression. In are2A or arelANAare2A double 
mutants, acylcoenzyme A: cholesterol acyltransferase- 
related enzyme I over-expression complemented the sterol 
esterification defect (Fig. 4C) . In wild-type and 

30 arelDNA single mutants, the high level expression of 

acylcoenzyme A: cholesterol acyltransferase-related 
enzyme I did not elevate sterol ester synthesis above 
untransformed controls. This suggests that either 
substrates are limiting in acylcoenzyme A: cholesterol 
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acyitransf erase-related enzymes strains or that the 
enzyme is post-translationally regulated as in mammalian 
cells (27) . 

5 An accumulation of unesterified sterol in cell membranes 
would likely be deleterious (28) . However, despite the 
major changes in sterol esterif ication conferred by the 
are mutants, we did not detect any reduction in growth 
rates. The established role of sterol esterif ication in 

10 the storage of sterol suggests that an inability to 

esterify sterol could lead to homeostatic changes in 
sterol biosynthesi?. This->relationship might account for 
the viability of the mutants. Total lipids, labelled by 
the incorporation of acetate into exponentially 

15 growing cells (29, 30), were saponified and extracted. 

The arel ANaare2A double mutants had a two to three- fold 
lower level of sterol biosynthesis than wild-type cells, 
although no changes were observed in the single mutants 
(Fig. 4D) . In fact, free sterol concentrations were 

20 roughly equivalent in all cells. Feedback regulation of 

sterol biosynthesis by acylcoenzyme A: cholesterol 
acyitransf erase activity has been observed in mammalian 
cells (31) and may be a common mechanism that maintains 
intracellular sterol at non-toxic concentrations. 

25 

The involvement of multiple gene families in sterol 
homeostasis is common in mammalian and yeast cells, for 
example, the LDL receptor related protein and scavenger 
receptor gene families, the SREBP family, and 3-hydroxy- 
30 3-methyl-glutanyl-CoA reductase) (4, 32, 33, 34) . This 

apparent redundancy of function has clear physiological 
consequences as evidenced by deletion of any one of the 
family members. The observation here of two yeast genes 
for sterol esterif ication provoked . the hypothesis of 
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similar redundancy for this reaction in humans. To this 
end, a consensus of the yeast acylcoenzyme A: cholesterol 
acyltransferase-related enzymes and human acylcoenzyme A: 
cholesterol acyltransferasel sequences was used to 
5 identify an additional cDNA with significant identity 

(47%) to human acylcoenzyme A: cholesterol 
acyltransferasel and the yeast proteins (Figure IB, 
Genbank accession # R07932) . 

10 Sterol homeostasis is a complex event under subtle 

regulatory controls, one component of which is sterol 
esterification. Jhe demonstration here of multiple yeast 
and human acylcoenzyme A: cholesterol acyltransferase 
isoforms raises the possibility that in vivo, the enzymes 

15 exhibit alternate substrate preferences. The analysis of 

esterification reactions in yeast is likely to impact the 
understanding of sterol homeostasis and atherosclerosis 
in humans. 

20 Example 2 : 

Tissue specific expression of acylcoenyme A: cholesterol 
acyltransferase II was analyzed by Northern blot RNA 
hybridization of RNA obtained from the described tissues. 

25 Using the same materials and procedures of Chang, et al . 

(10), the specific expression of acylcoenyme A: 
cholesterol acyltransferase II in liver and muscle is 
documents, in contrast to similar experiments using the 
previously known acylcoenyme A: cholesterol 

30 acyltransferase I (10) (Figures 7A and 7B) . Acylcoenyme 

A: cholesterol acyltransferase II was also detected and 
specifically expressed in adrenal, thyroid and testicular 
tissues . 

35 
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EjxfrMPte 3: 

After determining the consensus sequence between the two 
yeast gene and the previously known human acylcoenzyme A: 
5 cholesterol acyltransf erase, the consensus sequence was 

compared to sequences deposited in Genbank. The clones 
containing the sequences that showed similarity to the 
consensus sequence were ordered from the I.M.A.G.E. 
Consortium, affiliated with Research Genetics, Inc., 2130 

10 Memorial Parkway S.W. Huntsville, Alabama 35801. Clones 
deposited with the I.M.A.G.E. consortium are publicly 
available upon revest. -"A particular clone, Genbank ID 
clone No. Z39933 was chosen. : This clone contains a cDNA 
fragment whose sequence encodes human acylcoenyme A: 

15 cholesterol acyltransferase II. The fragment was cut out 

with restrictions enzymes Bgl II and Not I. The 
resulting fragment was introduced into the yeast 
expression vector pRS426 at Bgl II and Not I sites 
downstream of the yeast promoter (GAL1/GA110) which is 

20 regulated by carbon sources. The resultant vector was 

designated Y epAB - ACAT 2 (Figure 6) . 

ExjunpJ.^ 4: 

25 Antisense RNA technology can be used to create mice, or 
mouse or human cell lines incapable of translating 
acylcoenzyme A: cholesterol acyltransferase II RNA into 
protein. Standard methods may be used to create an 
antisense oligonucleotide to the human homo log of 

30 acylcoenzyme A: cholesterol acyltransferase II These 
methods are well known in the art (36) . 

Specifically, part or all of a wildtype acylcoenzyme A: 
cholesterol acyltransferase II is ligated adjacent to a 
35 mammalian promoter in the opposite orientation. The 
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promoter and other replicatory mechanisms inside the cell 
will transcribe a human homolog of acylcoenzyme A: 
cholesterol acyl transferase II encoding, nonsense strand. 
This strand will bind with the coding mRNA which is 
5 normally synthesized to form a complex. Due to the 
formation of this complex, the antisense strand prevents 
the translation of the coding mRNA into protein. 

Further, one skilled in the art can synthesize an 
10 oligonucleotide in vitro which is capable of binding the 

mRNA that encodes a human homolog of acylcoenzyme A: 
cholesterol acyltransferase II so as to inhibit the 
translation of the mRNA into protein. The 
oligonucleotides can then be introduced into the subject 
15 using a pharmaceutically acceptable carrier. Methods of 

synthesizing naturally and non-naturally occurring 
oligonucleotides which are capable of inhibiting the 
translation of the mRNA into protein are well known in 
the art. Also, means of transfecting an organism with 
20 such oligonucleotides are well known in the field. 

Example 5: 

Mice can be made with an alteration in their genome, 
25 specifically at the acylcoenyme A: cholesterol 
acyltransferase II gene site. Standard methods may be 
used to alter the genome. These methods are well known 
in the art (37, 38) . 

30 One such process to achieve this goal involves disrupting 

the wildtype mouse homolog of acylcoenyme A: cholesterol 
acyltransferase II in vitro, then introducing the altered 
gene into mouse embryonal stem cells in such a way as to 
taret integration into the corresponding genomic region. 

35 This process can be performed such that both copies of 
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the wildtype acylcoenyme A: cholesterol acyltransf erase 
II are replaced by the altered, knock-out version. These 
modified cells can be introduced into blastocysts which 
will be allowed to develop into chimeric adults. Mice 
5 bearing the altered acylcoenyme A: cholesterol 
acyltransferase II gene will be mated to each other to 
generate homozygous mutant acylcoenyme A: cholesterol 
acyltransferase II animals. 



10 Further, one can breed two mice who are heterozygous for 

mutant acylcoenzyme A: cholesterol acyltransferase II. 
From their progeny, one ^killed in the art could select 
the progeny who are homozygous for mutant acylcoenzyme A: 
cholesterol acyltransferase II. Breeding and sele:;t.ing 

15 such progeny are well known in the art. 
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Second Series of Experiments 

The efficient regulation of intracellular sterol levels 
is required for cell viability by all eukaryotic 
5 organisms. When this regulation is aberrant in cells of 

the arterial wall, disease states such as atherosclerosis 
ensue. A critical component of this homeostasis is 
intracellular sterol esterif ication reaction, mediated by 
the enzyme, acyl coenzyme A-cholesterol acyltransf erase 

10 (ACAT) . In the model eukaryote, yeast, this laboratory 

has demonstrated that sterol esterif ication is mediated 
by a two gene family (Yang et al., Science . 1996, 
272:1353). The existence in human cells of two 
additional genes encoding ACAT related ^enzymes are 

15 demonstrated. These protein are termed ACAT related 

gene products ( ARGP ) 1 and 2, also known as acylcoenzyme 
A: cholesterol acyltransf erase II and acylcoenzyme A: 
cholesterol acyltransf erase III respectively. The ARGP s 
exhibit, marked sequence conservation to the human ACAT 

20 sequence {hACAT) originally identified by Chang and 

colleagues. ARGP1 is expressed at high levels in 
intestine and liver in contrast to the expression of 
hACAT which is of low abundance in these tissues. The 
observation that knock-out mutant mice deficient in the 

25 murine homolog if hACAT retain sterol esterif ication 

activity in liver and intestine (Meiner et al . , PNAS , 
1996, 93:14041), suggests that ARGP1 is a candidate for 
sterol esterif ication in these tissues. The expression 
of ARGP2, by contrast, seems to be restricted to the 

30 fetal liver, suggesting it to have a role in lipid 

metabolism during development. Analysis of genome 
databases indicates that ACAT- like gene families are a 
common occurrence in multiple organisms. It is 

hypothesize that multiple enzymes for sterol 

35 esterif ication will provide flexibility in response to 

differing sterol and fatty acid substrates encountered by 
different tissues. This further suggests specific roles 
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for these enzymes in lipoprotein production, lipid 
homeostasis, and disease progression. 

The regulation of membrane sterol levels is required for 
5 cell viability by all eukaryotic organisms. When this 

regulation is aberrant in human cells, disease states 
such as atherosclerosis (excessive accumulation of 
cellular esterified cholesterol in cells of the arterial 
wall, reviewed in (1-4)), Niemann Pick C (inability to 

10 store sterol correctly, resulting in lysosomal lipidosis, 

(5) ) or Wollmann's disease (a defect in sterol ester 
hydrolysis, (6)) ensue. A critical component of this 
homeostasis is th$ -intracellular neutralization of sterol 
by an esterif ication reaction between the C 3 -0H group of 

15 cholesterol and fatty acyl -coenzyme A. This reaction is 

performed in mammalian cells by the enzyme acyl coenzyme 
A-cholesterol acyltransf erase (ACAT) . Since the process 
of sterol esterif ication converts sterol into a 
cytoplasmic storage form, it is critical to all 

20 eukaryote, including the microorganism Saccharomyces 

cerevisiae (budding yeast) . Analysis if sterol 

homeostasis in this model organism has the advantage that 
molecular genetics, particularly since the completion of 
the yeast genome sequencing project, is powerful and 

25 relatively straightforward. Taking advantage of this, it 

is demonstrated that sterol esterif ication in yeast is 
mediated by a two gene family (7) , neither of which is 
essential for life. These genes ( ARE1 and ARE 2 ; 
encoding ACAT R elated Enzymes 1 and 2, respectively) are 

30 both capable of independently esterif ying sterol, 

although in terms of contribution to the sterol ester 
mass of the cell, Arel is a minor isoform relative to 
Are2. The genes are structurally and functionally 
analogous to the ACAT sequence isolated originally from 

35 macrophages by Chang and colleagues (8) . They share 

approximately 23% identity at the protein level and 
expression of the human macrophage ACAT cDNA in yeast are 
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double deletion mutants results in esterif ication of 
sterol (9) . 

A critical test of the role of the ACAT gene product in 
5 cholesterol homeostasis and atherosclerosis was initiated 
by Farese and colleagues, by the production of "knock- 
out s" at the Acact locus corresponding to the mouse 
homolog of hACAT (10) . The fidelity of the mutation was 
confirmed by sequencing of cDNA from the disrupted allele 

10 and by the failure to detect immunoreactive protein in 

Acactf^cell extracts. The animals were healthy and 
fertile and had residual, but significant, sterol 
esterif ication activity in"*f ibroblasts and macrophages. 
Cholesterol ester levels and ACAT activity in the 

15 adrenals were also severely reduced. Conversely, Acact^ 1 

livers contained significant levels of cholesterol ester, 
and esterif ication activity was not altered. 
Furthermore, sterol absorption in the intestine, a 
process that probably requires esterif ication, was 

20 unaffected by the gene disruption. These observations 

strongly suggest that as in yeast, there are multiple 
genes for the ACAT reaction in mammalian cells, probably 
with tissue specific expression patterns. 

25 Interestingly, despite the clear origin of the yeast gene 

family by gene duplication, the ARE proteins have 
diverged such that the majority of sequence conservation 
is in the COOH- terminal domain of the protein. This is 
presumably the critical region of the molecule, since it 

30 is also conserved in the human protein. Using this 

region as a database probe, R07932 (11) was identified, 
a partially sequenced cDNA entry in the database of 
expressed sequence tags (best) ; R07932 exhibits 
significant similarity to the ACATs particularly over the 

35 COOH- terminal region. Taken together; the "founder" 

sequence, the observations in yeast of a two gene family 
for sterol esterif ication, and the \ tissue-specific 
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expression patterns of enzyme activity in Acac t^ knock - out 
mice, suggest that there are multiple genes for this 
reaction in all eukaryote. It is reported here the 
isolation and characterization of cDNAs from two human 
5 loci that encode ACAT Related Gene Products ( ARGP ) . 

ARGP1 is represented multiple times in the best, 
including R07932, and is expressed ubiquitously with the 
highest levels occurring in the liver, intestine and 
adrenal gland. By contrast, sequences identical to ARGP 2 
10 in the databases are infrequent, consistent with the 

observation of an essentially embryonic pattern of 
expression. Analysis of genome databases indicates that 
gene families that conserve these motifs are a common 
occurrence in multiple organism. 

15 

Materials and Methods* 

Database searching for ACAT related sequences. A 
sequence corresponding to the strongest region of protein 
conservation between the human macrophage ACAT and yeast 

20 ARE sequences was used to identify protein sequences 

predicted to be encoded by entries in the best using the 
tblastn software (NCBLI) . The DNA sequences thus arising 
were used to detect additional clones in any available 
database, that demonstrated overlaps of nucleotide 

25 sequence identity. Databases searched included; best, 

the non-redundant GENBANK, and the confidential database 
held at The Institute of Genome Research (TIGR) . 
Overlaps between these sequences were detected using the 
sequence alignment programs, "lineup" and "pileup" from GCG 

30 Inc (Madison, WI) . A consensus sequence was then 

generated. Escherichia coli clones with the largest 
inserts corresponding to these sequences (see table 1) 
were obtained from the I.M.A.G.E. consortium and 
resequenced from both ends using commercial primers, T3 

35 and T7, or internal primers derived from a consensus. 

Nucleotide sequencing was performed at the Columbia 
University Combined Center core facility using an Applied 
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Biosystems fluorescent sequencing machine. 

Table 1: Entries of human ACAT related gene products in 
the products in the data base of expressed 
sequence tags . 



10 



15 



20 



25 



30 



35 



Gene 



ARGP1 



ARGP2 



Clone ID 
(IMAGE) 

200587 

55218 



1881180 

78614 
153836 

106260, 

128921 

213176 
245265 



GENEBANK ID 

R99213 
R99214 
C-IMF11 
Z43867 
*; Z33993 
H45923 
H45924 
M79086 
R484 74 
R48475 
T35085 

R10272 
R10273 
N75438 
H76642 



Insert size 
(bp) 

620 

1800 

10PO 

300 
800 

800 

680 

540 
300 



Comments 



chimera 



Isolation and sequenc ing analysis of full length cDNA 
clones of ARGP1 and ARGP2. Since in no instance were any 
of the database clones full length for either ARGP1 or 
ARGP2 . additional clones with intact 5' -ends are 
described. Several strategies were chosen using a 
consensus nucleotide sequence derived from the sequencing 
of the best clones designed and synthesized 3 -end, gene 
specific primers and used a PCR based, rapid 
amplification of cDNA ends (RACE) to derive 5' -RACE 
reaction products from a human liver/spleen Marathon 
library (Clontech*) . Similar strategy was used to derive 
PCR products from a human fetal brain library generously 
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provided by Bento Soares (Columbia University) . In some 
instances, a nested PCT reaction was performed using 
internal gene specific primers and library adaptors. 
Finally primer extension cDNA products were identified 
5 from mRNA extracted from human intestine (a kind gift of 

P. Dawson) . Amplification products of the predicted size 
were confirmed as gene specific, using southern 
hybridization to sequences predicted to be at the 3 '-end 
of these products. The products were isolated from 

10 agarose gels using Geneclean and subcloned into TA 

variants of pBluescript (Stratagene*) vectors of 
klenow/kinase treated and blunt end ligated to pG£M2 
(Promega*) . Positive clones were identified by colony 
hybridization or by PCR amplifications using an internal 

15 ARGP specific primer. Clones with the Targest inserts 

were sequ ; ;.ced to obtain novel sequence and where 
necessary, t;iis process was reiterated with ARGP 5' 
specific primers derived from the new sequence. 

20 Tissue specific expression of hACAT and ARGps . Fragments 

of the best clones R99213 and R10273 corresponding to 
ARGP1 and ARGP2 . respectively were derived by digestion 
with Eco RI and Not I , and purified from agarose gels with 
Geneclean. A 1.6 kbp fragment corresponding to the human 

25 ACAT cDNA identified by Chang et al was used as a probe 

for the expression of this gene. Radiolabelled probes 
were generated by random priming (Pharmacia*) in the 
presence of 32 -P dCTP and used to probe Multiple Tissue 
Northerns (MTN, Clontech*) of human samples. 

30 Hybridizations were performed, according to the 

manufacturers instructions, using ExpressHyb rapid 
hybridization solution for 1 hour at 78°C, followed by 
washed in 2xSSC at 55°C and O.lxSSC, 0.5%SDS at 50°C. 

35 Cell culture expression of ARGPs. To facilitate 

quantitation of mRNA from the ARGP genes, a reverse- 
transcriptase PCR (RT-PCR) approach was devised to 
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analyze expression in a variety of human <HeoG2, THP-l 
macrophages) and rodent (J774 macrophages) and simian 
(CV1 kidney cells) . Where possible, primers were 
designed to be conserved between rodents and humans (as 
5 described below, the mouse sequence homolog to ARGP1 ) has 

been identified. Alternatively, PCR conditions were 
optimized to permit moderate mismatches. The ARGP 
amplification primers were designed to be gene specific 
(i.e. to regions not conserve within the family) and to 
10 produce distinct size products. 

Experimental Results and Discussion 

The approach that the region of strongest conservation 
between the yeast ARE proteins and hACAT would be 

15 critical to the function of any sterol esterif ication 

enzymes was taken. A region of conservation ( consensus ; 

LN E FGDR-FY GDWWN, single letter amino-acid code) 

that is invariant over the three proteins was chosen and 
a series of entries derived from gene sequencing projects 

20 identified. In addition to sequences from 

Caenorhabdi tis. electans. Schizosacharomuces pombe. 
Drosophila mela noaater and Arabidopis thaliens . several 
entries in the best of human cDNAs that suggested an 
independent gene encoding an ACAT like protein were 

25 observed. Using the nucleotide sequence to this clone, 

a second homologous but distinct entry was identified. 
These proteins are termed, ACAT Related Gene Products 
( ARGP) 1 (acylcoenzyme A: cholesterol acyltransf erase II) 
and 2 (acylcoenzyme A: cholesterol acyltransferase III) . 

30 The sequence identified by Chang et al (8) will be 

referred to as hACAT, hereon. A limited protein sequence 
to a founder clone (R07932) to ARGP1 has been presented 
previously (11) . The entries in the best that define 
these two genes, including their insert sizes are 

35 described in table 1. As is evident, the majority of 

inserts (with the exception of a chimeric clone ZA3867) 
are less than lKbp. The northern and sequence analysis 
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presented indicated them to be incomplete clones. 
However, they clearly define two distinct genes of strong 
similarity to the ACAT sequence, with the majority of 
predicted protein conservation at the COOH-terminal 
5 region. As described below certain motifs considered 
critical to sterol esterif ication are conserved. To 
identify the role of these genes in the reaction, full 
length ARGP clones were sought and their patterns 
identified. 

10 

ARGP1, a ubiquitously expressed member of the ACAT gene 
family. To establish the profile of expression of ARGP1 , 
probed multiple J; issue .northerns of human mRNA was 
probed, using a fragment close to the 3' end of the gene. 

15 Although this region displays the maximum conservation at 

the protein level in this gene family, the genes are 
sufficiently divergent at the DNA level to be able to 
design gene specific hybridization probes. The ARGP1 
sequence is expressed at abundant levels in may tissues 

20 with the exception of lung and kidney. The majority of 

tissues express a 2 . Okb message but, some tissues (e.g. 
adrenal, small intestine, thymus) also express a 2.4kb 
mRNA at varying levels. The same northerns were 
hybridized with a probe to the human macrophage ACAT 

25 sequence. As described by others (8, 12, 13) , the hACAT 

sequence detects 4 messages of approximately 3.0, 4.0, 
4.7 and 7.4Kb. Upon comparison of the two hybridization 
results, an overlapping but occasionally differential 
expression pattern was observed. Adrenal tissues express 

30 the highest levels of both hACAT and ARGP1 message. By 

this analysis, hACAT messages are rare in liver and 
intestine in contrast to ARGP1 whir.h iij highly expressed 
in these tissues. Conversely, ARGP1 was poorly expressed 
in kidney, lung and placenta although hACAT mRNA was 

35 easily detected. This tissue specific expression 
suggests that ARGP1 is an ideal candidate for sterol 
esterif ication in tissues such as liver and intestine, 
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which retain sterol esterif ication activity in ACAT k/o 
mice (10) . 



ARGP2 1 — an embryonic is oform of the ACAT aenp family. 
5 Efforts to identify a transcript from ARGP2 in adult 

tissues were unsuccessful. Therefore embryonic tissue 
samples were chosen to investigate since the original 
founder clone was derived from a fetal liver library. A 
multiple tissue northern of mRNA from human embryonic 

10 brain, liver, kidney, and lung, were probed with and 

ARGP2 specific, COOH- terminal probe. As shown in figure 
9, a single message of ~2.2kb was identified only in 
embryonic liver tissues, -.-suggesting a high degree of 
tissue and developmental specificity to the ^expression of 

15 this gene product. 

Express ion of ARGP1 in cell culture models. To develop 
a system in which to test the effect of reaction 
substrates on the esterif ication reaction performed by 

20 the ARGP enzymes. The expression of these genes in 

several tissue specific were examined, cell culture 
models. As shown in figure 10, ARGP1 is clearly 
expressed in liver (HepG2) and Kidney (CV-l) cell lines. 
The latter result is somewhat in contrast to the northern 

25 blot on human tissue samples. This most likely reflects 

the sensitivity of the RT-PCR approach compared to filter 
hybridization and suggests that ARGPl is probably 
expressed in most tissues. Alternatively it may 
represent species difference (simian vs. human) or more 

30 interestingly the differentiation status of the cells 

under study. In data not shown here, ARGPl was also 
clearly expressed in human and mouse macrophage models 
(THP-1 and J774 cells) . 



35 



Sequence characteristics of ARGPl and ARGP 2 . By a 
combination of 5 ' -RACE and primer extension additional 
sequence to cDNA s for ARGPl and ARGP 2 (Figs. 11 and 12) 
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have been identified. The ARGP1 sequence. predicts a 407 
amino-acid protein with approximately 27% identity and 
52% similarity to the hACAT protein (Fig. 4) . 
Interestingly, as it was observed for the yeast ARE 
5 proteins, the strongest conservation exists at the COOH- 
terminus of the molecules, to the extent that the NH-2- 
terminal 50% of all these proteins is essentially 
unrelated sequence. This pattern also persists at the 
DNA level (not shown) . Identification of the genomic 

10 sequence to these cDNAs will establish whether this 

remarkable divergence arises by exon shuffling of common 
sequences. Alternatively, convergent evolution of 
domains with conserved functions in sterol esterif ication 
or related processes, may have resulted in the generation 

15 of these families. Since the level of DNA conservation 

between ARGP1 and hACAT is quite lov '.:m identity) , the 
latter possibility seems likely. Th.; conserved regions 
are discussed in the context of multiple ACAT like 
sequences below. The ARGP1 sequence predicts a protein 

20 of approximately 47kDa with multiple transmembrane 

domains in similar positions to those predicted in hACAT. 
This strongly suggests a membrane location for ARGP1 as 
would be predicted for a sterol esterif ication enzyme. 

25 ARGP2 displays a significantly higher level of amino acid 

conservation with hACAT than does ARGP1 . Over the 
sequence shown (Fig. 12), the protein is 59% identical 
and 79% similar to human ACAT. Over the same region 
ARGP1 is only conserved at the level of 32% identity. 

30 This striking identity is maintained at the DNA level 

(62% identity) and may suggest that ARGP2 is more closely 
analogous to hACAT in both its mechanism of action and 
its origin, than is ARGP1 . As for ARGP1 , certain 
hallmark sequences are retained in ARGP2 (see below) . 

35 The ARGP2 predicted protein also possesses several 

predicted transmembrane domains. One entry to the best 
for ARGP2 has also been allocated an STS {sequence tagged 
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site) at the Whitehead Institute, (entry « WI-H660) and 
has thus been mapped to human chromosome 12. 

Sterol esterification enz ymes evolve as gene fam ^i^ 
5 multiple org^qisms. Using the hACAT and AGRP nucleotide 

sequences as probes of multiple databases, we sought to 
establish whether the observation of gene families of 
ACAT related enzymes in yeast and humans was a common 
occurrence in other organisms. In general this is the 

10 case (Fig. 13). Sequences from the genome of C. eleoans, 

D. melapoqastor and S . pombe . have been identified that 
are distinct from each other, within an organism, and 
exhibit approximately 25*"* identity at the predicted 
protein level. As for all the ACAT-like proteins, the 

15 maximum conservation is observed at the COOH- terminal 

region, with many of the apparently critical motifs 
described below, being maintained. As would be 
anticipated the mouse cDNA for ARGP1 exhibits 
approximately 85% identity with its human homolog. 

20 

Sequence conser vation between ARGPs and ACAT in multiple 
organisms . As described above, these sequences are 
ubiquitous. This conservation, across and within 
organisms, facilitates the identification of critical 

25 domains of esterification enzymes (Fig. 14) . 

Interestingly, there is no sequence similarity between 
any ACAT-like molecule and lecithin cholesterol 
acyltransferase (LCAT) , despite the shared utilization of 
cholesterol. For the hACAT sequence and its murine 

30 homologs, a similarity to "signature' motifs of enzymes 

involved in acyl adenylation reactions was reported (8, 
12) . However, these sequences are unlikely to be 
critical, since they are not conserved in any homolog 
from any other organism. By contrast, there are regions 

35 of strong conservation between these molecules which may 

be critical to function. In the esterification 
defective, SRD4 mutant CHO cell line, the expressed but 
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defective ACAT allele encodes a single amino-acid 
substitution of leucine 265 lies in a conserved domain of 
human, rodent and yeast ACAT. Interestingly, this motif 
in ARGPl is more degenerate, although the serine is 
5 conserved, the flanking sequence is conservatively 

replaced by similar residues. The ACAT reaction is 
probably mediated by a multimeric complex, as shown by 
radiation inactivation experiments (15) . Accordingly, 
the yeast and human sequences all possess "leucine zipper" 

10 multimerization motifs. ARGPl and ARGP2 lack a classical 
multimerization motif. Although protein phosphorylation 
as a mode of ACAT regulation has been refuted (16), a 
very strong regicjn of conservation (consensus over 7 
sequences; LN E - - - FGDR - FYGDWWN , single letter amino- 

15 acid code) predicts a tyrosine kinase consensus motif for 

phosphorylation. ARGP2 and ARGPl are no exception to 
this. In particular the aspartic a c id -tryptophan - 
tryptophan-asparagine (DWWN) sequence appears to be 
invariant (with the exception of S.pombe. where it is 

20 AWWN) and may represent an active site for the 

esterif ication reaction. These regions if conservation 
are targets for mutagenesis and in preliminary 
experiments appear critical to the activity of the ACAT 
and ARE enzymes (17) . 

25 

Whv ACAT gene fa milies? The role, if any, of these ACAT 
sequence homologs in sterol homeostasis is unclear. 
Since mouse macrophage ACAT is not critical to sterol 
esterif ication in the liver and intestine, it is possible 

30 that the additional enzymes evolved to recognize 

alternate substrates and thus promote sterol absorption 
in the intestine or production of lipoproteins by the 
liver (39) . Future experiments will be directed to 
complete the molecular characterization of these genes 

35 and test these hypotheses. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(I) APPLICANT: Stephen L. Sturley 

(ii) TITLE OF INVENTION: DNA ENCODING ACYLCOENZYME A: CHOLESTEROL 

ACY LTRANS FE RAS E 11 AND USES THEREOF 

(iii) NUMBER OF SEQUENCES : 19 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: 

(B) STREET: 1185 Avenue of the Americas 
© CITY: New York 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036 / 

(v) COMPUTER READABLE FORM: ^ 
{AT MEDIUM TYPE: Floppy disk 
(B) COMPUTER: IBM PC compatible 
© OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : Not Yet Known 

(B) FILING DATE: Herewith 
© CLASSIFICATION: 

(viii) ATTORNEY / AGENT INFORMATION: 
(A) NAME : John P. White 
<B) REGISTRATION NUMBER: 28,678 
© REFERENCE /DOCKET NUMBER: 0575/50852-A 

{ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (212) 278-0400 
<B) TELEFAX: (212) 391-0525 
© TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3649 base pairs 

(B) TYPE: nucleic acid 
© STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GGGTAGAGAC GGGGTTTCAC CGTGTTAGCC AGGATGGTCT GGATCTCCTG ACCTCGTGAT 60 
CCACCCACCT CGGCCTCCTA AAGTGCTGGG ATTACAGACA TGAGCCACCG CGCCCAGCCC 120 
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TATTCATCCC TTTTCAAAAG TCAGACCCTA GGAAGCTGGA GGGAGGTGGG GCATGGTTTT 180 

ACAGT GAATT TCTGATTTCA CTCAGGGTGA TAAATCAGAC TCTTGGGGAA GCGGGTGGTG 240 

GCT CTGGACA GCAGCAGGAA TGGGGATCCA GTTAGCAACA AATCCATGGA CCTATGACAG 300 

GCTGAAAGCC ACCCCTTCTC CATCTTTGGG AGGTTGCCAA TGTCTGATTT AACACTATCC 360 

AATGAATGAT CATTGAAAGT AAAAAATAAC TATCAACTAG CAGAAAATAT AAATGGTAAG 420 

CATTAGCACA TATTTCACAT GTTTATATTT GGCTCTCAGA TTGACCTATA AAACAAAGTC 480 

TGGGAAATTC TATATGATCC TGAAAAAATG ATACGCTGGT CTGGATGGTA GAATAAGTTG 540 

GAGAAATGTT TAAGCCAAAA TGCAGT CTTA CCAATGACTT TTTATTTTAT TTTATTAATT 600 

TTCAGGATTT TTGGTATACA GGTGGTTTTT GGTTACATGG AAAAGTTCTT TACT GGT GAT 660 

TTCTGAGATT TTAGTTCACC CCTTATCCTG AGCAGTGTAC ACTGTTCCCA ATATGTAGCC 720 

TTTTATCCCT CACCCCCTCT AAGTTCAAGA- AGACTATGGT CCTGCAGAAA GCTTT AT AT G 780 

TAATTAACAT ATCTTTATCT TT AT CTTT AT AGGCAGTAGA CTCATCTTTT GAAACAGATT 840 

CCATTAAGAG TGAAT GTGT A CCCTCCCTCT AGCCTTTATT ATTACTGTTT T T GCTATT AC 900 

ATGT GTTAGT GTAT GTGAAT TTAATGCTTA AAAATGTATC CCATTGGCTA CTATGGCAAA 960 

AGGTTGACTC ATAAGAGTTT AGCACGGGTT AAGATCTGAA AGTTTTCTCC CAGCCTCTTA 1020 

TCACTGGCGC AGACTTCACA ATTCATGGAA GCCACCAGTG AG AT GAC ATT GCCTCAGGCA 1080 

GTT ACT ATT T TTATATTCTA TAACTCGAGG AGCTCAGGGT TTCGGAAATC ATTAAACTTT 1140 

TTTT GTCCTT TTAAAGTTGG AGACAGCAAT TGTAGACAGC CTTCCAGTGG GTTATCTTTT 1200 

TGTGTCTCCT TACCTGTGGA GAAGCCTATT AGCTGGGATA TGTAGTTAAA TAGCTATATT 1260 

TATATATATC CAGGGCACCC CGAATTCGGG AGAGCTTCCC GGAGTCGACC TTCCTGCTGG 1320 

CTGCTCTGTG ACCGCTTCCC GGCTCTGCCC TCTTGGCCGA AGTGCCCGCT GCCGGGCGCG 1380 

GGCCTCAGAC AATACAATGG TGGGTGAAGA GAAGATGTCT CTAAGAAACC GGCTGTCAAA 1440 

GTCCAGGGAA AATCCTGAGG AAGATGAAGA CCAGAGAAAC CCTGCAAAGG AGTCCCTAGA 1500 

GACAC CTAGT AATGGTCGAA TT G ACAT AAA ACAGTTGATA GCAAAGAAGA TAAAGTTGAC 1560 

AGCAGAGGCA GAGGAATTGA AGC CAT TTTT TAT GAAGGAA GTTGGCAGTC ACTTTGATGA 1620 

TTTT GT GAC C AATCTCATTG AAAAGTCAGC AT CATT AGAT AATGGTGGGT GCGCTCTCAC 1680 

AACCTTTTCT GTTCTTGAAG GAGAGAAAAA CAACCATAGA GCGAAGGATT TGAGAGCACC 1740 

TCCAGAACAA GGAAAGATTT TTAT TGCAAG GCGCTCTCTC TTAGATGAAC TGCTTGAAGT 1800 

GGACCACATC AGAACAATAT ATCACAT GTT TATT GCCCTC CTCATTCTCT TTATCCTCAG 1860 

CACACTTGTA GTAGATTACA TTGATGAAGG AAGGCTGGTG CTTGAGTTCA GCCTCCTGTC 1920 

TTATGCTTTT GGCAAATTTC CTACCGTTGT TTGGACCTGG TGGATCATGT TCCTGTCTAC 1980 

ATTTTCAGTT CCCTATTTTC TGTTTCAACA TTGGCGCACT GGCTATAGCA AGAGTTCTCA 2040 
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TCCGCTGATC 


CGTTCTCTCT 


TCCATGGCTT 


TCTTTTCATG ATCTTCCAGA TTGGAGTTCT 


2100 


AGGTTTTGGA 


CCAACATATG 


TTGTGTTAGC 


ATATACACTG 


CCACCAGCTT 


CCCGGTTr'ZlT 


2160 


CATT AT ATT C 


GAGCAGATT C 


GTTTTGTAAT 


GAAGGCCCAC 


TCATTTGTCA 


GAGAGAACGT 


2220 


GCCTCGGGTA 


CT AAATT f Xk. 


CTAAGGAGAA ATCAAGCACT GTTCCAATAC 


CTACAGTCAA 


2280 


CCAC7TATTTG 


1 c/"iV- 1 i l,i 1H1 


TTGCTCCTAC 


CCTTATCTAC 


CGTGACAGCT 


ATCCCAGGAA 


2340 


1 ULiuHL 1 bin 




ATGTCGCTAT 


GAAGTTTGCA 


CAGGTCTTTG 


GTTGCTTTTT 


2400 


CTATGTGTAC 


T ACAT CTTT G 


AAAGGCTTTG 


TGCCCCCTTG 


TTTCGGAATA 


TCAAACAGGA 


2460 




GCTCGTGTTC 


TGGTCCTATG 


TGGTATTTAA 


CTCCATCTTG 


CCAGGTGTGC 


2520 




CCTTACTTTT 


TTTGCCTTTT 


TGCACTGCTG 


GCTCAATGCC 


TTT GCT GAGA 


2580 


1 Lr 1 I ALGCll 


TGGTGACAGG 


ATGTTCTATA AGGATTGGTG 


GAACTCCACG 


TCATACTCCA 


2640 


ALI /VI I ATAG 


AACCTGGAAT 


GTGGTGGTCtf/TVTGACTGGfcT ATATTACTAT GCTTACAAGG 


2700 


ACTTTCTCTG 


GTTTTTCTCC 


AAGAGATTCA 


AATCTGCTGC 


CATGTTAGCT 


GTCTTTGCTG 


2760 


TATCTGCTGT 


AGTACACGAA 


TATGCCTTGG 


CTGTTTGCTT 


GAGCTTTTTC 


TATCCCGTGC 


2820 


TGTTCGTGCT 


CTTCATGTTC 


TTTGGAATGG 


CTTTCAACTT 


CAT TGT CAAT 


GATAGTCGGA 


2880 




TTGGAATGTT 


CTGATGTGGA 


CTTCTCTTTT 


CTTGGGCAAT 


GGAGTCTTAC 


2940 


1 v~I tjC ill 1A 


TTCTCAAGAA 


TGGTATGCAC 


GTCGGCACTG 


TCCTCTGAAA AATCCCACAT 


3000 


1111 LHjtMl 1A 


TGTCCGGCCA CGTTCCTGGA 


CTTGTCGTTA 


C GTGTTTT AG 


AAGCTT GGAC 


3060 


TTT fJTTT CCT 
l J Iwl 1 J A 


CCTTGTCACT 


GAAGATTGGG 


TAGCTCCCTG 


ATTTGGAGCC 


AGCTGTTTCC 


3120 




GAAGTTATCT 


GTGTTATTTG 


GACCACTCCA 


GGCTTTACAG 


ATGACTCACT 


3180 




GTCACTTGAA 


GCCAAACTGT 


TGGAAGTTCA 


CTGGAGTCTT 


GTACACTTAA 


3240 


GCAGAGCAGA 


ACTTTTTTTG 


TGGGGCTGGG 


TGGGGGGAGA AGACCGACTA 


ACAGCTGAAG 


3300 


TAAT GACAGA 


TTGTTGCTGG 


GTCATATCAG 


CTTTATCCCT 


TGGTAATTAT 


ATCTGTTTTG 


3360 


TTTCTTGACT 


CTGTCCAATC 


AGAGAATAAA 


CATCATAGTT 


TCTTGGCCAC 


TGAATTAGCC 


3420 


AAAACACTTA 


GGAAGAAATC 


ACTTAAATAC 


CTCTGGCTTA 


GAAATTTTTT 


CATGCACACT 


3480 


GTTGGAATGT 


ATGCTAATTG 


AACATGCAAT 


TGGGGAAGAA AAAATGTAGA AT GATTTTTG 


3540 


CTATTTCTAG 


TAGAAAGAAA ATGTCTGTTT 


TCCAAAGATA 


ATGTTATACA 


TCCTATTTTG 


3600 


T AATT *i YTTT 


GAAAAAAGTT 


CAATGTTCAG 


TTTTCCTTAGT TTTTACCTT 




3660 


(2) INFORMATION FOR SEQ ID NO:2: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 amino acids 
<B> TYPE: amino acid 
(C) STRANDEDNESS : single 
(DJ TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Val 
1 



Gly Glu Glu Lys Met Ser Leu Arg Asn Arg Leu 
5 10 



Ser Lys Ser 
15 



Arg Glu Asn Pro Glu Glu Asp Glu Asp Gin Arg Asn Pro Ala Lys Glu 
20 25 30 

Ser Leu Glu Thr Pro Ser Asn Gly Arg He Asp He Lys Gin Leu He 
35 40 45 

Ala Lys Lys He Lys Leu Thr Ala Glu Ala Glu Glu Leu Lys Pro Phe 
50 55 60 

Phe Met Lys Glu Val Gly Ser His Phe Asp Asp Phe Val Thr Asn Leu 
65 ^O ^ ^ 75 80 

He Glu Lys Ser Ala Ser Leu Asp Asn Gly Gly Cys Ala Leu Thr Thr 
85 90 ^ 95 

Phe Ser Val Leu Glu Gly Glu Lys Asn Asn His Arg Ala Lys Asp Leu 
100 105 HO 

Arg Ala Pro Pro Glu Gin Gly Lys He Phe He Ala Arg Arg Ser Leu 
H5 120 125 

Leu Asp Glu. Leu Leu Glu Val Asp His He Arg Thr He Tyr His Met 
130 135 140 

Phe He Ala Leu Leu He Leu Phe He Leu Ser Thr Leu Val Val Asp 
145 150 155 160 

Tyr He Asp Glu Gly Arg Leu Val Leu Glu Phe Ser Leu Leu Ser Tyr 
165 170 175 

Ala Phe Gly Lys Phe Pro Thr Val Val Trp Thr Trp Trp He Met Phe 
180 185 190 

Leu Ser Thr Phe Ser Val Pro Tyr Phe Leu Phe Gin His Trp Arg Thr 
195 200 205 

Gly Tyr Ser Lys Ser Ser His Pro Leu He Arg Ser Leu Phe His Gly 
210 215 220 

Phe Leu Phe Met He Phe Gin He Gly Val Leu Gly Phe Gly Pro Thr 
225 230 235 240 

Tyr Val Val Leu Ala Tyr Thr Leu Pro Pro Ala Ser Arg Phe He He 
245 250 255 

He Phe Glu Gin He Arg Phe Val Met Lys Ala His Ser Phe Val Arg 
260 265 270 

Glu Asn Val Pro Arg Val Leu Asn Ser Ala Lys Glu Lys Ser Ser Thr 
275 280 285 

Val Pro He Pro Thr Val Asn Gin Tyr Leu Tyr Phe Leu Phe Ala Pro 



290 



295 



300 
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Thr Leu He Tyr Arg Asp Ser Tyr Pro Arg Asn Pro Thr Val Arg Trp 
305 310 315 320 

Gly Tyr Val Ala Met Lys Phe Ala Gin Val Phe Gly Cys Phe Phe Tyr 
325 330 335 

Val Tyr Tyr He Phe Glu Arg Leu Cys Ala Pro Leu Phe Arg Asn lie 
340 345 350 

Lys Gin Glu Pro Phe Ser Ala Arg Val Leu Val Leu Cys Val Phe Asn 
355 360 365 

Ser He Leu Pro Gly Val Leu He Leu Phe Leu Thr Phe Phe Ala Phe 
370 375 380 

Leu His Cys Trp Leu Asn Ala Phe Ala Glu Met Leu Arg Phe Glv Asd 
385 390 395 * 40 b 

Arg Met Phe Tyr Lys Asp Trp Trp Asn Ser Thr Ser Tyr Ser Asn Tyr 
405 410 4 i5 

Tyr Arg Thr Trp Asn Val Val'-Val His Asp Trp Leu Tyr Tyr Tyr Ala 
420 425 430^ 

Tyr T.y.'i Asp Phe Leu Trp Phe Phe Ser Lys Arg Phe Lys Ser Ala Ala 
435 440 445 

Met e'. Ala Val Phe Ala Val Ser Ala Val Val His Glu Tyr Ala Leu 
«50 455 460 

Ala Val Cys Leu Ser Phe Phe Tyr Pro Val Leu Phe Val Leu Phe Met 
465 . 470 - 475 460 

Phe Phe Gly Met Ala Phe Asn Phe He Val Asn Asp Ser Arg Lys Lys 
485 490 495 

Pro He Trp Asn Val Leu Met Trp Thr Ser Leu Phe Leu Gly Asn Gly 
500 505 510 

Val Leu Leu Cys Phe Tyr Ser Gin Glu Trp Tyr Ala Arg Arg His Cys 
515 520 525 

Pro Leu Lys Asn Pro Thr Phe Leu Asp Tyr Val Arg Pro Arg Ser Trp 
530 535 540 

Thr Cys Arg Tyr Val Phe 
545 550 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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GCCCTCCAGC 


TCTCTACTAA 


GACCGGTCGC 


AAGCATGCTG 


GGCGATATAT 


CCAAACCACA 


60 


CCACACATGG 


TCTCCCTCCT 


GCGTCAAAAT 


CTCCCCAGAC 


AGTCCGGACC 


CGCACCCGAT 


120 


ATC CAGAATG 


AAACTGCACG 


GCTGCAGATT 


CAAAAGCTCC 


AACGCCCTCA 


GCGTCATCTT 


1B0 


CGCCTGGATA 


TGCTGCACTC 


TGGTCGAACC 


CGTGTACTTG 


TGTGCTTCGC 


TATCATTATA 


240 


GAAAATCTCC 


GGTGGTGCCA 


ACTCCTCAGG 


ACGTGACATT 


ATTTCTTCTC 


TGATATATTT 


300 


CCTGTGTTTC 


CGTACCGCAC 


CTTTTTAGCA 


CTAcrmrr 


ACTATGCTCT 


TCTTCTTCTG 


360 


CTTCTTCTGC 


TTTTTTCCTC 


TTTATCACAC 


TATGTATGTG 


CTGCTCATCT 


CTTCTTTTTA 


420 


TCGATAAAAT 


TGAAAAATGT 


GAGATGGTGT AGAGTGAAAA AAAAAAAAAA ATCTGGCTTG 


4B0 


GCCATCAAAT 


ACCCGGCCGT 


GGTTGGACTC 


GTTTAGCGAA 


CAATAGCACC 


CAGCAGACCC 


540 


TGGCAACATG 


CGGATGATAT 


AAGAAGGACG 


AGCGTGGTGG 


AGGAAAGGGG 


CGCCATTGGC 


600 


ACACTCACGC 


AGGT GGTTGT 


TCAGCACOGC 


TTGCAGCAAG 


AGCGCCAAAA 


CAGATTGCAA 


660 


GAATGACGGA 


GACTAAGGAT 


TTGTTGCAAG 


ACGAAGAGTT 


TCTTAAGATC 


CGCAGACTCA 


720 


ATTCCGCAGA 


AGCCAACAAA 


CGGCATTCGG 


TCACGTACGA TAACGTGATC 


CTGCCACAGG 


780 


AGTCCATGGA 


GGTTTCGCCA 


CGGTCGTCTA 


CCACGTCGCT 


GGTGGAGCCA 


GTGGAGTCGA 


840 


CTGAAGGAGT 


GGAGTCGACT 


GAGGCGGAAC 


GTGTGGCAGG 


GAAGCAGGAG 


CAGGAGGAGG 


900 


AGTACCCTGT 


GGACGCCCAC 


ATGCAAAAGT 


ACCTTTCACA 


CCTGAAGAGC 


AAGTCTCGGT 


960 


CGAGGTTCCA 


CCGAAAGGAT 


GCTAGCAAGT 


ATGTGTCGTT 


TTTTGGGGAC 


GTGAGTTTTG 


1020 


ATCCTCGCCC 


CACGCTCCTG 


GACAGCGCCA 


TCAACGTGCC 


CTT CCAGACG 


ACTTTCAAAG 


1080 


GTCCGGTGCT 


GGAGAAACAG 


CTCAAAAATT 


TACAGTTGAC 


AAAGACCAAG 


ACCAAGGCCA 


1140 


CGGTGAAGAC 


TACGGTGAAG 


ACTACGGAGA 


AAACGGACAA 


GGCAGAT GCC 


CCCCCAGGAG 


1200 


AAAAACTGGA 


GTCGAACTTT 


T CAGGGATCT 


ACGTGTTCGC 


ATGGATGTTC 


TTGGGCTGGA 


1260 


TAGCCATCAG 


GTGCTGCACA 


GATT ACT AT G 


CGTCGTACGG 


CAGTGCATGG 


AATAAGCTGG 


1320 


AAATCGTGCA 


GTACATGACA 


ACGGACTTGT 


TCACGATCGC 


AATGTTGGAC 


TTGGCAATGT 


1380 


TCCTGTGCAC 


TTTCTTCGTG 


GTTTTCGTGC 


ACTGGCTGGT 


GAAAAAGCGG 


ATC AT CAACT 


1440 


GGAAGTGGAC 


TGGGTTCGTT 


GCAGTGAGCA 


TCTTCGAGTT 


GGCTTTCATC 


CCCGTGACGT 


1500 


TCCCCATTTA 


CGTCTACTAC 


TTTGATTTCA ACTGGGTCAC 


GAGAATCTTC 


CTGTTCCTGC 


1560 


ACTCCGTGGT 


GTTTGTTATG 


AAGAGCCACT 


CGTTTGCCTT 


TTACAACGGG 


TATCTTTGGG 


1620 


ACATAAAGCA 


GGAACTCGAG 


TACTCTTCCA AACAGTTGCA AAAATACAAG GAATCTTTGT 


1680 


CCCCAGAGAC 


CCGCGAGATT 


CTGCAAAAAA 


GTTGCGACTT 


TTGCCTTTTC 


GAATTGAACT 


1740 


AC C AGACCAA 


GGATAACGAC 


TTCCCCAACA ACATCAGTTG 


CAGCAATTTC 


TTCATGTTCT 


1800 


GTTTGTTCCC 


CGTCCTCGTG 


TACCAGATCA 


ACTACCCAAG 


AACGTCGCGC 


ATCAGATGGA 


1860 


GGTATGTGTT 


GGAGAAGGTG 


TGCGCCATCA 


TTGGCACCAT 


CTTCCTCATG ATGGTCACGG 


1920 
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CACAGTTCTT CATGCACCCG GTGGCCATGC GCTGTATCCA GTTCCACAAC ACGCCCACCT 1980 

TCGGCGGCTG GATCCCCGCC ACGCAAGAGT GGTTCCACCT GCTCTTCGAC ATGATTCCGG 2040 

GCTTCACTGT TCTGTACATG CTCACGTTTT AC AT GAT AT G GGACGCTTTA TTGAATTGCG 2100 

TGGCGGAGTT GACCAGGTTT GCGGACAGAT ATTTCTACGG CGACTGGTGG AATTGCGTTT 2160 

CGTTTGAAGA GTTTAGCAGA AT CTGGAACG TCCCCGTTCA CAAATTTTTA CTAAGACACG 2220 

TGTACCACAG CTCCATGGGC GCATTGCATT TGAGCAAGAG CCAAGCTACA TTATTTACTT 2280 

TTTTCTTGAG TGCCGTGTTC CACGAAATGG CCATGTTCGC CATTTTCAGA AGGGTTAGAG 2340 

GATATCTGTT CATGTTCCAA CTGTCGCAGT TTGTGTGGAC TGCTTTGAGC AACACCAAGT 2400 

TTCTACCGGC AAGACCGCAG TTGTCCAACG TTGTCTTTTC GTTTGGTGTC TGTTCAGGGC 2460 

CCAGTATCAT TATGACGTTG TACCTGACCT TATGAACTGC CACCATACCA CGTGTGTCCC 2520 

TCGCAAGCCC TTGATAGATA TACAATAGGG AATGGGCGTC CGTCCACCGT GGTCAAAGAC 2580 
AGGGGCAAAG AGCTCCTAGG T 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Thr Glu Thr Lys Asp Leu Leu Gin Asp Glu Glu Phe Leu Lys He 
13 10 15 

Arg Arg Leu Asn Ser Ala Glu Ala Asn Lys Arg His Ser Val Thr Tyr 
20 25 30 

Asp Asn Val He Leu Pro Gin Glu Ser Met Glu Val Ser Pro Arg Ser 
35 40 45 

Ser Thr Thr Ser Leu Val Glu Pro Val Glu Ser Thr Glu Gly Val Glu 

50 55 60 

Ser Thr Glu Ala Glu Arg Val Ala Gly T.ys Gin Glu Gin Glu Glu Glu 
65 70 7j 80 

Tyr Pro Val Asp Ala His Met Gin Lys Tyr Leu Ser His Leu Lys Ser 
85 90 95 

Lys Ser Arg Ser Arg Phe His Arg Lys Asp Ala Ser Lys Tyr Val Ser 
100 105 110 

Phe Phe GLy Asp Val Ser Phe Asp Pro Arg Pro Thr Leu Leu Asp Ser 
115 120 125 
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Ala He Asn Val Pro Phe Gin Thr Thr Phe Lys Gly Pro Val Leu Glu 
130 135 140 

Lys Gin Leu Lys Asn Leu Gin Leu Thr Lys Thr Lys Thr Lys Ala Thr 
145 150 155 160 

Val Lys Thr Thr Val Lys Thr Thr Glu Lys Thr Asp Lys Ala Asp Ala 
165 170 175 

Pro Pro Gly Glu Lys Leu Glu Ser Asn Phe Ser Gly He Tyr Val Phe 
180 185 190 

Ala Trp Met Phe Leu Gly Trp He Ala He Arg Cys Cys Thr Asp Tyr 
195 200 205 

Tyr Ala Ser Tyr Gly Ser Ala Trp Asn Lys Leu Glu He Val Gin Tyr 
210 215 220 

Met Thr Thr Asp Leu Phe Thr He Ala Met Leu Asp Leu Ala Met Phe 
225 230 235 240 

Leu cys Thr Phe Phe Val vVl Phe Val His Trp Leu Val Lys Lys Arg 
245 250 255 

He He Asn Trp Lys Trp Thr Gly Phe Val Ala Val Ser He Phe Glu 
260 265 270 

Leu Ala Phe He Pro Val Thr Phe Pro He Tyr Val Tyr Tyr Phe Asp 
275 280 285 

Phe Asn Trp Val Thr Arg He Phe Leu Phe Leu His Ser Val Val Phe 
290 , 295 300 

Val Met. Lys Ser His Ser Phe Ala Phe Tyr Asn Gly Tyr Leu Trp Asp 
305 310 315 320 

He Lys Gin Glu Leu Glu Tyr Ser Ser Lys Gin Leu Gin Lys Tyr Lys 
325 330 335 

Glu Ser Leu Ser Pro Glu Thr Arg Glu He Leu Gin Lys Ser Cys Asp 
340 345 350 

Phe Cys Leu Phe Glu Leu Asn Tyr Gin Thr Lys Asp Asn Asp Phe Pro 
355 360 365 

Asn Asn He Ser Cys Ser Asn Phe Phe Met Phe Cys Leu Phe Pro Val 
370 375 380 

Leu Val Tyr Gin He Asn Tyr Pro Arg Thr Ser Arg He Arg Trp Arg 
385 390 395 400 

Tyr Val Leu Glu Lys val Cys Ala He He Gly Thr He Phe Leu Met 
405 410 415 

Met Val Thr Ala Gin Phe Phe Met His Pro Val Ala Met Arg Cys He 
420 425 430 

Gin Phe His Asn Thr Pro Thr Phe Gly Gly Trp He Pro Ala Thr Gin 
435 440 445 

Glu Trp Phe His Leu Leu Phe Asp Met He Pro Gly Phe Thr Val Leu 
450 455 460 
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Tyr Met Leu Thr Phe Tyr Met lie Trp Asp Ala Leu Leu Asn Cys Val 
465 47 ° 475 480 

Ala Glu Leu Thr Arg Phe Ala Asp Arg Tyr Phe Tyr Gly Asp Trp Trp 
485 490 495 

Asn cys Val Ser Phe Glu Glu Phe Ser Arg He Trp Asn Val Pro Val 
500 505 510 

His Lys Phe Leu Leu Arg His Val Tyr His Ser Ser Met Gly Ala Leu 
515 520 525 

His Leu Ser Lys Ser Gin Ala Thr Leu Phe Thr Phe Phe Leu Ser Ala 
530 535 540 

Val Phe His Glu Met Ala Met Phe Ala He Phe Arg Arg Val Arq Glv 
545 550 555 9 560 

Tyr Leu Phe Met Phe Gin Leu Ser Gin Phe Val Trp Thr Ala Leu Ser 
565 570 575 

Asn Thr Lys Phe Leu Arg Alaf^Arg Pro Gin Leu Ser Asn Val Val Phe 
580 585 590 

Ser Phe Gly Val Cys Ser Gly Pro Ser Tie He Met Thr Leu Tyr Leu 
595 600 605 

Thr Leu 
610 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TATAAAATTC CTTTCATCAA TACATCTATA TATTCGAATA TATAGATAAA CCAATACAAA 60 

AACATACTGA AATTTTTTGA AAACAACTAA AACTATTCAT TGCAGTTACA CGTGAATGCT 120 

AAACTTTATA TCGCTCTTGT CGGTCCCGCG GAGTTAACAT TTAACGGCTT CTCGCGCAAT 180 

AACCGGAAAA ATTC CAACAG TTT CTTTGTA AT ATT ATT AA GCCTTCTTTT TTCCCGGAAT 240 

CTATAAGAGG GGACGAAAAT TAGCCGCTAT TAATTCTGGT ATTGCCACCT AGACAAGAAG 300 

TAAACAGACA CATTACGTTA GCAAAAGCAA CAATAACAAA CACAACCATG GACAAGAAGA 360 

AGGATCTACT GGAGAACGAA CAATTTCTCC GCATCCAAAA GCTCAACGCT GCCGATGCGG 420 

GCAAAAGACA ATCTATAACA GTGGAC GACG AGG GCGAACT AT AT GGGTT A GACACCTCCG 480 

GCAACTCACC AGCCAATGAA CACACAGCTA CCACAATTAC ACAGAATCAC AGC GTGGTGG 540 
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CCTCAAACGG 


AGACGTCGCA 


TTCATCCCAG 


GAACTGCTAC CGAAGGCAAT ACAGAGATTG 


600 


TAACTGAAGA 


AGTGATTGAG 


ACCGATGATA ACATGTTCAA GACCCATGTG AAGACTTTAA 


660 


GCTCCAAAGA 


GAAGGCACGG 


TATAGGCAAG 


GGTCCTCTAA CTTTATATCG TATTTCGATG 


720 


ATATGTCATT 


TGAACACAGG 


CCCAGTATAT 


TAGATGGGTC AGTTAACGAG CCCTTCAAGA 


780 


CCAAATTCGT 


GGGACCTACT TTAGAAAAGG AGATCAGAAG AGGGGAGAAA GAGCTAATGG 


840 


CCATGCGCAA 


AAATTTACAC 


CACCGCAAGT 


CCTCCCCAGA TGCTGTCGAC TCAGTAGGGA 


900 


AAAATGATGG 


CGCCGCCCCA ACTACTGTTC 


CAACTGCCGC CACCTCAGAA ACGGTGGTCA 


960 


CCGTTGAAAC 


CACCATAATT 


TCATCCAATT 


TCTCCGGGTT GTACGTGGCG TTTTGGATGG 


1020 


CTATTGCATT 


TGGTGCTGTC 


AAGGCTTTAA 


TAGACTATTA TTACCAGCAT AATGGTAGCT 


1080 


TCAAGGATTC 


GGAGATCTTG 


AAATTTATGA 


CTACGAATTT GTTCACTGTG GCATCCGTAG 


1140 


ATCTTTTGAT 


GTATTTGAGC 


ACTTATTTTG TCGTTGGAAT ACAATACTTA TGCAAGTGGG 


1200 


GGGTCTTGAA 


ATGGGGCACT 


ACCGGCTGGA 


TCTTCACCTC AATTTACGAG TTTTTGTTTG 


1260 


TTATCTTCTA 


CATGTATTTA 


ACAGAAAACA 


TCCTAAAACT ACACTGGCTG TCCAAGATCT 


1320 


TCCTTTTTTT 


GCATTCTTTA 


GTTTTATTGA 


TGAAAATGCA TTCTTTCGCC TTCTACAATG 


1380 


GCTATCTATG 


GGGTATAAAG 


GAAGAACTAC 


AATTTTCCAA AAGCGCTCTT GCCAAATACA 


1440 


AGGATTCTAT 


AAATGATCCA 


AAAGTTATTG 


GTGCTCTTGA GAAAAGCTGT GAGTTTTGTA 


1500 


GTTTTGAATT 


GAGCTCTCAG 


TCTTTAAGCG ACCAAACTCA AAAATTCCCC AACAATATCA 


1560 


GTGCAAAAAG 


CTTTTTTTGG 


TTCACCATGT 


TTCCAACCCT AATTTACCAA ATTGAATATC 


1620 


CAAGAACTAA 


GGAAATCAGA 


TGGAGCTACG 


TATTAGAAAA GATCTGCGCC ATCTTCGGTA 


1680 


CCATTTTCTT 


AATGATGATA 


GATGCTCAAA 


TCTTGATGTA TCCTGTAGCA ATGAGAGCAT 


1740 


TGGCTGTGCG 


CAATTCTGAA 


TGGACTGGTA 


TATTGGATAG ATTATTGAAA TGGGTTGGAT 


1800 


TGCTCGTTGA 


TATCGTCCCA 


GGGTTTATCG 


TGATGTACAT CTTGGACTTC TATTTGATTT 


1860 


GGGATGCCAT 


TTTGAACTGT 


GTGGCTGAAT 


TGACAAGATT TGGCGACAGA TATTTCTACG 


1920 


GTGACTGGTG 


GAATTGTGTT 


AGTTGGGCAG 


ACTTCAGTAG AATTTGGAAC ATCCCAGTGC 


1980 


ATAAGTTTTT 


GTTAAGACAT 


GTTTACCATA 


GTTCAATGAG TTCATTCAAA TTGAACAAGA 


2040 


GTCAAGCAAC 


TTTGATGACC 


TTTTTCTTAA 


GTTCCGTCGT TCATGAATTA GCAATGTACG 


2100 


TTATCTTCAA 


GAAATTGAGG 


TTTTACTTGT 


TCTTCTTCCA AATGCTGCAA ATGCCATTAG 


2160 


TAGCTTTAAC 


AAATACTAAA 


TTCATGAGGA ACAGAACCAT AAT CGGAAAT GTTATTTTCT 


2220 


GGCTCGGTAT 


CTGCATGGGA 


CCAAGTGTCA 


TGTGTACGTT GTACTTGACA TTCTAAGGCA 


2280 


TCCTGCAACT 


GTTCTGTGGA 


GCTATTAAAT 


CTTTATAGTA AATTTTTTTT TACTTTTTTT 


2340 


TTT TTTTTTT 


TTTTTTTTTA 


TTATTTACAA 


GCGTCTATAT ATTTTCTATT ATAGAAT AT T 


2400 


GTCATTTATT 


ACATTGGTTC 


A 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 amino acids 
(B> TYPE: amino acid 
(C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 

<ii> MOLECULE TYPE : Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Asp Lys Lys Lys Asp Leu Leu Glu Asn Glu Gin Phe Leu Arg lie 
15 10 15 

Gin Lys Leu Asn Ala Ala Asp Ala Gly Lys Arg Gin Ser He Thr Val 
20 25 30 

Asp Asp Glu Gly Glu Leu Ty£ Gly Leu Asp Thr Ser Gly Asn Ser Pro 
35 40 45 

Ala Asn Glu His Thr Ala Thr Thr He Thr Gin Asn His Ser Val Val 
50 55 60 

Ala Ser Asn Gly Asp Val Ala Phe He Pro Gly Thr Ala Thr Glu Gly 
65 70 75 80 

Asn Thr Glu He Val Thr Glu Glu Val He Glu Thr Asp Asp Asn Met 
85 90 95 

Phe Lys Thr His Val Lys Thr Leu Ser Ser Lys Glu Lys Ala Arg Tyr 
100 105 no 

Arg Gin Gly Ser Ser Asn Phe He Ser Tyr Phe Asp Asp Met Ser Phe 
115 120 125 

Glu His Arg Pro Ser He Leu Asp Gly Ser Val Asn Glu Pro Phe Lys 
130 135 140 

Thr Lys Phe Val Gly Pro Thr Leu Glu Lys Glu He Arg Arg Arg Glu 
145 150 155 160 

Lys Glu Leu Met Ala Met Arg Lys Asn Leu His His Arg Lys Ser Ser 
165 170 175 

Pro Asp Ala Val Asp Ser Val Gly Lys Asn Asp Gly Ala Ala Pro Thr 
180 185 190 

Thr Val Pro Thr Ala Ala Thr Ser Glu Thr Val Val Thr Val Glu Thr 
195 200 205 

Thr He He Ser Ser Asn Phe Ser Gly Leu Tyr Val Ala Phe Trp Met 
210 215 220 

Ala He Ala Phe Gly Ala Val Lys Ala Leu He Asp Tyr Tyr Tyr Gin 
225 230 235 240 

His Asn Gly Ser Phe Lys Asp Ser Glu He Leu Lys Phe Met Thr Thr 
245 250 255 
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Asn Leu Phe Thr Val Ala See Val Asp Leu Leu Met Tyr Leu Ser Thr 
260 265 270 

Tyr Phe Val Val Gly He Gin Tyr Leu Cys Lys Trp Gly Val Leu Lys 
275 280 285 

Trp Gly Thr Thr Gly Trp He Phe Thr Ser He Tyr Glu Phe Leu Phe 
290 295 300 

Val He Phe Tyr Met Tyr Leu Thr Glu Asn He Leu Lys Leu His Trp 
305 310 315 320 

Leu Ser Lys He Phe Leu Phe Leu His Ser Leu Val Leu Leu Met Lys 
325 330 335 

Met His Ser Phe Ala Phe Tyr Asn Gly Tyr Leu Trp Gly He Lys Glu 
340 345 350 

Glu Leu Gin Phe Ser Lys Ser Ala Leu Ala Lys Tyr Lys Asp Ser He 
355 360 365 

Asn Asp Pro Lys Val He Gfy Ala Leu" Glu Lys Ser Cys Glu Phe Cys 
370 375 380 

Ser Phe Glu Leu Ser Ser Gin Ser Leu Ser Asp Gin Thr Gin Lys Phe 
385 390 395 400 

Pro Asn Asn He Ser Ala Lys Ser Phe Phe Trp Phe Thr Met Phe Pro 
405 410 415 

Thr Leu He Tyr Gin He Glu Tyr Pro Arg Thr Lys Glu He Arg Trp 
. 420 r 425 430 

Ser Tyr Yal Leu Glu Lys He Cys Ala He Phe Gly Thr He Phe Leu 
435 440 445 

Met Met He Asp Ala Gin He Leu Met Tyr Pro Val Ala Met Arg Ala 
450 455 460 

Leu Ala Val Arg Asn Ser Glu Trp Thr Gly He Leu Asp Arg Leu Leu 
465 470 475 480 

Lys Trp Val Gly Leu Leu Val Asp He Val Pro Gly Phe He Val Met 
485 490 495 

Tyr He Leu Asp Phe Tyr Leu He Trp Asp Ala He Leu Asn Cys Val 
500 505 510 

Ala Glu Leu Thr Arg Phe Gly Asp Arg Tyr Phe Tyr Gly Asp Trp Trp 
515 520 525 

Asn Cys Val Ser Trp Ala Asp Phe Ser Arg He Trp Asn He Pro Val 
530 535 540 

His Lys Phe Leu Leu Arg His Val Tyr His Ser Ser Met Ser Ser Phe 
545 550 555 560 

Lys Leu Asn Lys Ser Gin Ala Thr Leu Met Thr Phe Phe Leu Ser Ser 
565 570 575 

Val Val His Glu Leu Ala Met Tyr Val He Phe Lys Lys Leu Arg Phe 
580 585 590 
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Tyr Leu Phe Phe Phe Gin Met Leu Gin Met Pro Leu Val Ala Leu Thr 
595 600 605 

Asn Thr Lys Phe Met Arg Asn Arg Thr lie He Gly Asn Val He Phe 
610 615 620 

Trp Leu Gly He Cys Met Gly Pro Ser Val Met Cys Thr Leu Tyr Leu 
625 630 635 640 

Thr Phe 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 983 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA ^ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATGGAGCTCA ACTTTCCCCG CTCTCCCCGC ATCCGGAAGC GCTTTCTGCT GCGACGGATC 5(, 

CTTGAGATGC TGTTCTTCAC CCAGCTCCAG GTGGGGCTGA TCCAGCAGTG GATGGTCCCC 120 

ACCATCCAGA ACTCCAT GAA GCCCTTCAAG GACATGGACT ACTCACGCAT CATCGAGCGC 180 

CTCCTGAAGC TGGCGGTCCC CAATCACCTC ATCTGGCTCA TCTTCTTCTA CTGGCTCTTC 240 

CACTCCTGCC TGAATGCCGT GGCTGAGCTC ATGCAGTTTG GAGACCGGGA GTTCTACCGG 300 

GACTGGTGGA ACTCCGAGTC TGTCACCTAC TTCTGGCAGA ACTGGAACAT CCCTGTGCAC 360 

AAGTGGT GCA TCAGACACTT CTACAAGCCC ATGCTTCGAC GGGGCAGCAG CAAGTGGATG 420 

GCCAGGACAG GGGTGTTCCT GGCCTCGGCT TTCTTCCACG AGTACCTGGT GAGCGTCCCT 480 

CTGCGAATGT TCCGCCTCTG GGCTTTCACG GGCATGATGG CTCAGATCCC ACTGGCCTGG 540 

TTCGTGGGCC GCTTTTTCCA GGGCAACTAT GGCAACGCAG CTGTGTGGCT GTCGCTCATC 600 

AT CGGACAGC CAATAGCCGT CCTCATGTAC GTCCACGAAC TACTACGTGC TCAACTATGA 660 

GGCCCCAGCG GCAGAGGCCT GAGCTGCACC TGAGGGCCTG GCTTCTCACT GCCACCTCAA 720 

ACCCGCTGCC AGAGCCCACC TCTCCTCCTA GGCCTCGAGT GCTGGGGATG GGCCTGGCTG 780 

CACAGCATCC TCCTCTGGTC CCAGGGAGGC CTCTCTGCCC TATGGGGCTC TGTCCTGCAC 840 

CCCTCAGGGA TGGCGACAGC AGGCCAGACA CAGTCTGATG CCAGCTGGGA GTCTTGCTGA 900 

CCCTGCCCCG GGTCCGAGGG TGTCAATAAA GTGCTGTCCA GTGAGAAAAA GAAAAAAAAA 960 
AAAAAAAAAA ATTCTGCGGC CGC 
(2> INFORMATION FOR SEQ ID NO: 8: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Glu Leu Asn Phe Pro Arg Ser Pro Arg lie Arg Lys Arg Phe Leu 
15 10 15 

Leu Arg Arg lie Leu Glu Met Leu Phe Phe Thr Gin Leu Gin Val Gly 
20 25 30 

Leu lie Gin Gin Trp Met Val Pro Thr He Gin Asn Ser Met Lys Pro 

35 , 40 45 

/ 

Phe Lys Asp Met Asp Tyr Ser Arg He He Glu Arg Leu Leu Lys Leu 
50 55 60 ^ 

Ala Val Pro Asn His Leu He Trp Leu He Phe Phe Tyr Trp Leu Phe 
65 70 75 80 

His Ser Cys Leu Asn Ala Val Ala Glu Leu Met Gin Phe Gly Asp Arg 
85 90 95 

Glu Phe Tyr Arg Asp Trp Trp Asn Ser Glu Ser Val Thr Tyr Phe Trp 
100 105 HO 

Gin Asn Trp Asn He Pro Val His Lys Trp Cys He Arg His Phe Tyr 
115 120 125 

Lys Pro Met Leu Arg Arg Gly Ser Ser Lys Trp Met Ala Arg Thr Gly 
130 135 140 

Val Phe Leu Ala Ser Ala Phe Phe His Glu Tyr Leu Val Ser Val Pro 
145 150 155 160 

Leu Arg Met Phe Arg Leu Trp Ala Phe Thr Gly Met Met Ala Gin He 
165 170 175 

Pro Leu Ala Trp Phe Val Gly Arg Phe Phe Gin Gly Asn Tyr Gly Asn 
180 185 190 

Ala Ala Val Trp Leu Ser Leu He He Gly Gin Pro He Ala Val Leu 
195 200 205 

Met Tyr Val His Glu Leu Leu Arg Ala Gin Leu 
210 215 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) . TOPOLOGY: linear 
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other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

ATGTTGAACT TCATGATGCA TGACCAGCGC ACCGGCCCGG CAT GGAAC GT GCTGATGTGG 60 

ACCATGCTGT TTCTAGGCCA GGGAATCCAG GTCAGCCTGT ACTGCCAGGA GTGGTACGCA 120 

CGGACGCACT GfcCCCTTACC CCAGGCAACT TTCTGGGGGC TGGTGACACC TCGATCTTGG 180 

TCCTGCCATA CCTAGAGGTC GGGACAGACG ACGCTACCTG CCCAGACACC ACCAAGTTCT 240 

CTGCCTGCAA AACCTGGGGA CCAGGACTTC CTGTCTTGCA TTCCCAAATT TGGGTTCTTG 300 

AGTC GAGGCA ACCTT GCACA CAAGACCCCA CCAAGGGATT GTTGCAAGGG ATTAGATTTT 360 

GCAGATTTGT TGGGTAATGA TTCAACGACT CAGCTGGGGG TTGACCAGGG TTGATTTTTC 420 

AATCCTTTTC CCCTGGGTTT GGGT T ACAGC/ TTTTT 455 
(2) INFORMATION FOR SEQ ID NO: 10: ~ 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Leu Asn Phe Met Met His Asp Gin Arg Thr Gly Pro Ala Trp Asn 
15 10 15 

Val Leu Met Trp Thr Met Leu Phe Leu Gly Gin Gly lie Gin Val Ser 
20 25 30 

Leu Tyr Cys Gin Glu Trp Tyr Ala Arg Thr His Cys Pro Leu Pro Gin 
35 40 45 

Ala Thr Phe Trp Gly Leu Val Thr Pro Arg Ser Trp Ser Cys His Thr 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 11: 

ATGGACAACG CGGGGTCTGA TACGACTCAC TATAGGGAAT TTGGCCCTCG AGCAGTAGAT 60 

TCGGCACGAT GGGCACGAGG ACT C CAT CAT GTTCCTCAAG CTTTATTCCT ACCGGGATGT 120 

CAACCTGTGG TGCCGCCAGC GAAGGGTCAA GGCCAAAGCT GTCTCTACAG GGAAGAAGGT 180 

CAGTGGGGCT GCTGCGAGCA AGCTGTGAGC TATCCAGACA ACCTGACCTA CCGAGATCTC 240 

GATTACTTCA TCTTTGCTCC TACTTTGTGT TATGAACTCA ACTTTCCTCG GTCCCCCCGA 300 

AT AC GAGAGC GCTTTCTGCT ACGACGAGTT CTTGAGATGC TCTTTTTTAC CCAGCTTCAA 360 

GTGGGGCTGA TCCAACAGTG GATGGTCCCT ACTATCCAGA ACTC CATGGA AGCCCTTTCA 420 

AGAGCTTCTG GCAGTTTTGG AGACCGCGAG TTCTACAGAG ATTGGTGGAA TGCTGAGTCT 480 

GTCACCGACT TTTGGCAGAA CTGGAATATC CCCGTGG 517 

(2) INFORMATION FOR SEQ ID NO: 12: 

/ 

(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids ~ 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 12: 

Met Asp Asn Ala Gly Ser Asp Thr Thr His Tyr Arg Glu Phe Gly Pro 
15 10 15 

Arg Ala Val Asp Ser Ala Arg Trp Ala Arg Gly Leu His His Val Pro 
20 25 30 

Gin Ala Leu Phe Leu Pro Gly Cys Gin Pro Val Val Pro Pro Ala Lys 
35 40 45 

Gly Gin Gly Gin Ser Cys Leu Tyr Arg Glu Glu Gly Gin Trp Gly Cys 
50 55 60 

Cys Glu Gin Ala Val Ser Tyr Pro Asp Asn Leu Thr Tyr Arg Asp Leu 
65 70 75 80 

Asp Tyr Phe He Phe Ala Pro Thr Leu Cys Tyr Glu Leu Asn Phe Pro 
85 90 95 

Arg Ser Pro Arg He Arg Glu Arg Phe Leu Leu Arg Arg Val Leu Glu 
100 105 no 

Met Leu Phe Phe Thr Gin Leu Gin Val Gly Leu He Gin Gin Trp Met 
115 120 125 

Val Pro Thr He Gin Asn Ser Met Glu Ala Leu Ser Arg Ala Ser Gly 
130 135 140 

Ser Phe Gly Asp Arg Glu Phe Tyr Arg Asp Trp Trp Asn Ala Glu Ser 
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145 150 155 160 

Val Thr Asp Phe Trp Gin Asn Trp Asn He Pro Val 
165 170 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 366 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Lys Asp Leu Leu Glu Phe; JLeu Lys _Jle Arg Leu Asn Ala Asp Ala 
1 5 . "io 15 

Lys Arg Ser Thr Asp Ser Pro Thr Val Ser Glu Val Glu Arg Gly Lys 
20 25 30 

Gin Glu He Glu Ala His Lys Ser Lys Lys Arg Phe Arg Ser Phe Ser 
35 40 45 

Phe Phe Asp Ser Phe Glu Arg Pro Ser Leu Leu Asp Gly Asn Pro Phe 
50 55 ' 60 

Thr Thr Phe Gly Pro Val Leu Glu Lys Glu Lys Asn Leu His Lys Lys 
65 - 70 75 80 

Lys Thr Thr Val Thr Asp Val Ser Asn Phe Ser Gly He Tyr Val Phe 
85 90 95 

Trp Met Leu Ala Leu Asp Tyr Tyr Gly Glu He Leu Tyr Met Thr Thr 
100 105 HO 

Leu Phe Thr Val Ala Asp Leu Met Phe Leu Ser Thr Phe Phe Val Val 
115 120 125 

Leu Lys Trp Thr Gly He Ser He Glu Phe Leu Phe He Phe Leu Trp 
130 135 140 

Ser Arg He Phe Leu Phe Leu His Ser Val Phe Val Met Lys His Ser 
145 150 155 160 

Phe Ala Phe Tyr Asn Gly Tyr Leu Trp He Lys Glu Glu Leu Ser Leu 
165 170 175 

Lys Tyr Lys Glu Ser Ser Pro Leu Gin Lys Ser Cys Phe Cys Phe Glu 
180 185 190 

Leu Gin Phe Pro Asn Asn He Ser Phe Phe Phe Phe Pro Thr Leu He 
195 200 205 

Tyr Gin He Tyr Pro Arg Thr He Arg Trp Tyr Val Leu Glu Lys Cys 
210 215 220 

Ala He Phe Gly Thr He Phe Leu Met Met Ala Gin Met: Pro Val Ala 
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225 230 235 240 

Met Arg Asn Phe Trp Gin Leu Leu Asp He Pro Gly Phe Val Leu Tyr 
245 250 255 

Leu Thr Phe Tyr He Trp Asp Ala Leu Asn Cys Val Ala Glu Leu Thr 
260 265 270 



Arg Phe Gly Asp Arg Tyr Phe Tyr Gly Asp Trp Trp Asn Cys Val Ser 
275 2B0 285 

Phe Ser Arg He Trp Asn Val Pro Val His Lys Phe Leu Leu Arg His 
290 295 300 

Val Tyr His Ser Ser Met Phe Lys Leu Lys Ser Gin Ala Thr Leu Thr 
305 310 315 320 

Phe Phe Leu Ser Ala Val Val His Glu Ala Met Val He Phe Arg Tyr 
325 330 335 



Leu Phe Phe Gin Gin Met Ala Leu Asn Thr Lys Phe Arg Arg He Asn 
340 345* 350 

Val Phe Trp Gly Cys Gly Pro Ser Val Thr Leu Tyr Leu Thr 
355 360 365 

INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : r single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Pro Asn His Leu He Trp Leu He Phe Phe Tyr Trp Leu Phe His Ser 

15 10 15 

Cys Leu Asn Ala Val Ala Glu Leu Met Gin Phe Gly Asp Arg Glu Phe 
20 25 30 

Tyr Arg Asp Trp Trp Asn Ser Glu Ser Val Thr Tyr Phe Trp Gin Asn 
35 40 45 

Trp Lys He Pro Val His Lys Trp Cys He Arg His Phe Tyr Lys Pro 
50 55 60 

Met Leu Arg Arg Gly Ser Ser Lys Trp Met Ala Arg Asp Arg Gly Val 
65 70 75 80 

Pro Gly Pro Ser Ala Phe Phe His Val Val Thr Trp Val Ser Val Pro 

85 90 95 



2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GAGGGGACGA AAATTAGCCG CTATTAATTC TGGTATTGCC AC CTAGACAA GAAGTAAACA 60 
GACACAGATG CAAGAGTTCG AATCTCTTAG C 91 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
CTATAAAGAT TTAATAGCTC CACAGAACAG TTGCAGGATG CCTTAGGGTC GACTACGTCG 60 
TAAGGCCGTT TCTGAC 76 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
C ATT GCAGTT ACACGTGAAT GC 22 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TAGCTCCACA GAACAGTTGC AGG 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
CTCTGACAAC AACGAAGTCA G > . 



21 
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What is claimed is: 

1. An isolated nucleic acid which encodes an 
acylcoenzyme A: cholesterol acyltransferase II. 



2. An isolated nucleic acid which encodes 
acylcoenzyme A: cholesterol acyltransferase III. 



an 



3. The isolated nucleic acid of claim 1 or 2, wherein 
10 the nucleic acid is DNA or RNA. 

4. The isolated nhcleic arid of claim 3, wherein the 
nucleic acid is cDNA or genomic DNA. ^ 

15 5. The isolated nucleic acid of claim 1 comprising a 
nucleic acid having the sequence as set forth in 
Figure 15. 

6. The isolated nucleic acid of claim 1, wherein the 
20 nucleic acid encodes a human wildtype acylcoenzyme 

A: cholesterol acyltransferase II having 
substantially the same amino acid sequence as set 
forth in Figure 15. 

25 7. The isolated nucleic acid of claim 2, comprising a 
nucleic acid having the sequence as set forth in 
Figure 16. 

8. The isolated nucleic acid of claim 2, wherein the 
30 nucleic acid encodes a human wildtype acylcoenzyme 

A: cholesterol acyltransferase III having 
substantially the same amino acid sequence as set 
forth in Figure 16. 



35 



9. 



The isolated nucleic acid of claim 1 comprising a 
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nucleic acid having the sequence designated Seq. 
I.D. No.: 11. 

10. The isolated nucleic acid of claim 1, wherein the 
nucleic acid encodes a mouse wildtype acylcoenzyme 
A: cholesterol acyltransf erase II having 
substantially the same amino acid sequence as the 
sequence designated Seq. I.D. No.: 12. 

11. The isolated nucleic acid of claim 1, wherein the 
nucleic acid encodes a mutant acylcoenzyme A: 
cholesterol acjyltransferase II. 

12. The isolated nucleic acid of claim 2, wherein the 
15 nucleic acio oncodes a mutant acylcoenzyme A: 

cholesterol acy? transferase III. 

13. A vector comprising the isolated nucleic acid of 
claim 1 or 2. 



10 



20 



25 



30 



14. The vector of claim 13 further comprising a promoter 
of RNA transcription operatively linked to the 
nucleic acid. 

15. The vector of claim 14, wherein the promoter 
comprises a bacterial, yeast, insect or mammalian 
promoter. 

16. The vector of claim 14, further comprising plasmid, 
cosmid, yeast artificial chromosome (YAC) , 
bacteriophage or eukaryotic viral DNA. 



17. The vector of claim 14 designated YEpAB-ACAT2 . 
35 18. A host vector system for the production of a 
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polypeptide which comprises the vector of claim 14 
in a suitable host. 

19. The host vector system of claim 18, wherein the 
suitable host is a prokaryotic or eukaryotic cell. 

20. The host vector system of claim 19, wherein the 
prokaryotic cell is a bacterial cell. 

21. The host vector system of claim 19, wherein the 
eukaryotic cell is a yeast, insect, plant or 
mammalian celly. 

22. A method for producing a polypeptide which comprises 
growing the host vector system of claim 18 under 
suitable conditions permitting production of the 
polypeptide and recovering the polypeptide so 
produced. 



23. A method of obtaining a polypeptide in purified form 
which comprises: 

(a) introducing the vector of claim 14 into a 
suitable host cell; 

(b) culturing the resulting cell so as to produce 
25 the polypeptide; 4 

© recovering the polypeptide produced in step 
(b) ; and 

id) purifying the polypeptide so recovered. 

30 24. A purified wildtype acylcoenzyme A: cholesterol 
acyltransf erase II. 

25. A purified mutant acylcoenzyme A: cholesterol 
acyltransf erase II. 

35 
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26. A purified wildtype acylcoenzyme A: cholesterol 
acyltransferase III. 

27. A purified mutant acylcoenzyme A: cholesterol 
5 acyltransferase III. 

28. An oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within a nucleic 

10 acid which encodes a wildtype acylcoenzyme A: 

cholesterol acyltransferase II without hybridizing 
to a nucleic acid, which encodes a mutant 
acylcoenzyme A: cholesterol acyltransferase II. 

15 29. An oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within the nucleic 
acid which encodes a mutant acylcoenzyme A: 
^ cholesterol acyltransferase II without hybridizing 

20 to a nucleic acid which encodes a wildtype 

acylcoenzyme A: cholesterol acyltransferase II. 

30. An oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 

25 sequence of nucleotides present within a nucleic 

acid which encodes a wildtype acylcoenzyme A: 
cholesterol acyltransferase III without hybridizing 
to a nucleic acid which encodes a mutant 
acylcoenzyme A: cholesterol acyltransferase III. 

30 

31. An oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within the nucleic 
acid which encodes a mutant acylcoenzyme A: 

35 cholesterol acyltransferase III without hybridizing 
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to a nucleic acid which encodes a wildtype 
acylcoenzyme A: cholesterol acyltransf erase III. 

32. The oligonucleotide of claim 28, 29, 30 or 31 
wherein the nucleic acid is DNA or RNA. 

33. A nucleic acid having a sequence complementary to 
the sequence of the isolated nucleic acid of claim 
1 or 2. 



34. A method for determining whether a subject known to 
have an imbalance in st'ferol levels has the imbalance 
due to a defect in esterif ication of^sterol which 
comprises : 

15 (a) obtaining from the subject an appropriate 

sample containing a mixture of all of the 
subject's nucleic acids; and 
(b) -determining whether any nucleic acid in the 
sample from step (a) is, or is derived from, a 

20 nucleic acid which encodes a mutant 

acylcoenzyme A: cholesterol acyltransf erase so 
as to thereby determine whether the subject's 
imbalance in sterol levels is due to a defect 
in esterif ication of sterol. 

25 

35. The method of claim 34, wherein the determining of 
step (b) comprises: 

(I) contacting the sample of step (a) with the 

isolated nucleic acid of claim 11 or 12 or 

30 the oligonucleotide of claim 29 or 31 

under conditions permitting binding of any 
nucleic acid in the sample which is, or is 
derived from, a nucleic acid which encodes 
a mutant acylcoenzyme A: cholesterol 

35 acyltransferase to the nucleic acid or 
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oligonucleotide so as to form a complex; 

(ii) isolating the complex so formed; and 

(iii) identifying the nucleic acid in the 
isolated complex so as to thereby 

5 determine whether any nucleic acid in the 

sample contains a nucleic acid which is, 
or is derived from, a nucleic acid which 
encodes a mutant acylcoenzyme A: 
cholesterol acyltransf erase II or III. 

10 

36. The method of claim 35, wherein the isolated nucleic 
acid or the^ -oligonucleotide is labeled with a 
detectable mar Jeer . 

15 37. The method of claim 36, wherein the detectable 
marker is a radioactive isotope, a f luoropi.ore or an 
enzyme. 

38. . The method of claim 35, wherein the nucleic acid 
20 sample is first bound to a solid matrix before 

performing step (I). 

39. The method of claim 35, wherein the sample comprises 
blood or sera. 

25 

40. A method for treating a subject who has an imbalance 
in sterol levels due to a defect in esterif ication 
of sterol which comprises introducing the isolated 
nucleic acid of claim 1 or 2 into the subject under 

30 conditions such that the nucleic acid expresses a 

wildtype acylcoenzyme A: cholesterol acyltransf erase 
II or III, so as to thereby treat the subject. 



41. 

35 



A method for inhibiting wildtype acylcoenzyme A: 
cholesterol acyltransf erase II or* III in a subject 
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which comprises transforming appropriate cells from 
the subject with a vector which expresses the 
nucleic acid of claim 33, and introducing the 
transformed cells into the subject so as to thereby 
inhibit wildtype acylcoenzyroe A: cholesterol 
acyltransf erase II or III. 



42. The method of claim 41 , wherein the nucleic acid of 
claim 33 is capable of specifically hybridizing to 

10 a mRNA molecule encoding acylcoenzyme A: cholesterol 

acyltransferase II or III so as to prevent 
translation of the mRNA molecule. 

43. A method for inhibiting the wildtype acylcoenzyme A: 
15 cholesterol acyltransferase II or III in a subject 

which comprises introducing the oligonucleotide of 
claim 28 or 30 into the subject so as to thereby 
inhibit the ^wildtype acylcoenzyme A: cholesterol 
acyltransferase II or III. 

20 

44. The method of claim 43, wherein the oligonucleotide 
of clam 28 or 30 is capable of specifically 
hybridizing to a mRNA molecule encoding acylcoenzyme 
A: cholesterol acyltransferase II or III so as to 

25 prevent translation of the mRNA molecule. 

45. A method for identifying a chemical compound which 
is capable of inhibiting acylcoenzyme A: cholesterol 
acyltransferase II or III in a subject which 

30 comprises: 

(a) contacting a wildtype acylcoenzyme A: 
cholesterol acyltransferase II or III with the 
chemical compound under conditions permitting 
binding between the acylcoenzyme and the 

3 5 chemical compound; 
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(b) detecting specific binding of the chemical 

compound to the acylcoenzyme; and 
<£> determining whether the chemical compound 
inhibits the activity of the coenzyme so as to 
5 identify a chemical compound which is capable 

of inhibiting acylcoenzyme A: cholesterol 
acyltransferase II ir III in a subject. 

46. A pharmaceutical composition comprising the chemical 
10 compound identified by the method of claim 45 in an 

amount effective to inhibit acylcoenzyme A: 
cholesterol acjyltransferase II or III in a subject 
and a pharmaceutical^ effective carrier. 

15 47. A method of treating a subject who has 

atherosclerosis comprising administering the 

pharmaceutical composition of claim 4 6 to the 
subject. 

20 48 . A method of treating a subject who has 

hyperlipidemia comprising administering the 

pharmaceutical composition of claim 46 to the 
subject . 



25 



49. 



A transgenic, nonhuman mammal comprising 
isolated nucleic acid of claim 1 or 2 . 



the 
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FIG. 2A 
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FIG. 2B 
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FIG. 2C 



1 



AR£2 
{6.0 Kb) 



ARLi 

{2M Kb) 



SUBSTITUTE SHEET (RULE 26) 



WO 97/43439 



PCT/US97/D9460 



6/33 



FIG. 2D 
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FIG. 2E 
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FIG. 3A 
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FIG. 4C Sterol Ester 
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FIG. 6A 
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FIG. 6C 
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FIG. 9 
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FIG. 10 




ARGP1 

RT-PCR product: 
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FIG. 11 

ARGP1 1 MSNYRG I LNWCWMLI 15 

. : . I . I - : : . : I I 

hACAT 101 EGEKNNHRAKDLRAPPEQGKIFIARRSLLDELLEVDHIRTIYHMFIALLI 150 

ARGP1 16 LSNARLFLENLIKYGILVDPIQWSLFLKDPHSWPAPCLVIAANVFAVAA 65 

hACAT 151 LF I LSTLWDY I DEGRLVLEFS l#LS YAFGKFPTWWTWWIMFLSTFSVPY 200 

ARGP1 66 FQVEKRLAVGALTEQA GLLLHVANLATILCFPAAWLLVESIT 108 

I • * ■ * • • * • : .-, 1:1: : - : : : I . I . : . 1:1. . : . 
hACAT 201 FLFQHWRTGYSKSSHPLIRSLFHGFLFMIFQIG . VLGFGPTYWLAYTLP 249 

ARGP1 109 PVGSLLALMAHTILFLKPFSYRDVNSWCRRARAKAASAGKKASSAAAPHT 158 

I.:.:: ::•:• :.:l: : I :-. I . ::. ..:|. ||.. : . | 

hACAT 250 PASRFIIIFEQIRFVMKA HSFVRENVPRVLNSAKEKSSTVPI PT 293 

ARGP1 159 VSYPWILTYRDLYYFLFAPTIOfELNFPRSPRIRKRFLLRRILEMLFFTQ 208 

I. ^.IMIIIII.I .:||.| : I. : | . 

hACAT 294 VN. QYLYFLFAPTLIYRDSYPRNPTVRWGYVAMK. FAQVFGCF 334 

ARGP1 209 LQVGLIQQWMVPTIQNSMKPFKDMDYSRI IERLLKLAV PNHLIWL 253 

• I • I s : : • : • : I I : I I . I I . II:: 

hACAT 335 FYVYYIFERLCAPL FRNIKQEPFSARVLVLCVFNS ILPGVLILF 378 

ARGP1 254 I F FYWLFH SC LNAVAELMQFGDREF YRDWWNS ESVT Y FWQNWN I P VH KWC 303 

: I: =:l: I I I . I I : : . I I I I ll:|||||.l . ::..||;.||.| 
hACAT 379 LTFFAFLHCWLNAF AEMIJ^FGDRMFYKI>fl^STS YShTYYRTWNVVVHDWL 428 

ARGP1 304 IRHFYKPMLRRGSSKWM . . ARTGVFLASAFFHEYLVSVPLR MFRL 346 

- - I I • : I : I - : : . I . : I I . I I . . I I I : . I . I . : | | 

hACAT 429 YYYAYKDFLWFFSKRFKSAAMLAVFAVS AVVHEYA1AVC LS FFY PVLFVL 478 

ARGP1 347 WAFTGMMAQ I P LAWFVG RF FQGNYGNAAVWLS L 1 1 GQ P I A VL.MYVH D YYV 396 

-•111 ::-::!. . - I - : I I I : : I . . : : : | : : : | . 

hACAT 479 FMFFGM AFNFIVNDSRiOCPIWN\nj4WTSLFIX^^ 524 

ARGP1 397 LNYEAPAAEA 406 

hACAT 525 RRHCPLKNPTFLDYVRPRSWTCRYVF 550 
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